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ABSTRACT 


Title  of  Dissertation:  DYNAMICS  OF  RANDOM  EARLY  DETECTION 

GATEWAY  UNDER  A  LARGE  NUMBER  OF 
TCP  FLOWS 

Peerapol  Tinnakornsrisuphap,  Doctor  of  Philosophy,  2004 

Dissertation  directed  by:  Professor  Armand  M.  Makowski 

Department  of  Electrical  and  Computer  Engineering 


While  active  queue  management  (AQM)  mechanisms  such  as  Random  Early 
Detection  (RED)  are  widely  deployed  in  the  Internet,  they  are  rarely  utilized  or 
otherwise  poorly  configured.  The  problem  stems  from  a  lack  of  a  tractable 
analytical  framework  which  captures  the  interaction  between  the  TCP 
congestion-control  and  AQM  mechanisms.  Traditional  TCP  traffic  modeling  has 
focused  on  “micro-scale”  modeling  of  TCP,  i.e.,  detailed  modeling  of  a  single 
TCP  flow.  While  micro-scale  models  of  TCP  are  suitable  for  understanding  the 
precise  behavior  of  an  individual  flow,  they  are  not  well  suited  to  the  situation 
where  a  large  number  of  TCP  flows  interact  with  each  other  as  is  the  case  in 


realistic  networks. 


In  this  dissertation,  an  innovative  approach  to  TCP  traffic  modeling  is 
proposed  by  considering  the  regime  where  the  number  of  TCP  flows  competing 
for  the  bandwidth  in  the  bottleneck  RED  gateway  is  large.  In  the  limit,  the 
queue  size  and  the  aggregate  TCP  traffic  can  be  approximated  by  simple 
recursions  which  are  independent  of  the  number  of  flows.  The  limiting  model  is 
therefore  scalable  as  it  does  not  suffer  from  the  state  space  explosion.  The 
steady-state  queue  length  and  window  distribution  can  be  evaluated  from 
well-known  TCP  models. 

We  also  extend  the  analysis  to  a  more  realistic  model  which  incorporates 
session-level  dynamics  and  heterogeneous  round-trip  delays.  Typically,  ad-hoc 
assumptions  are  required  to  make  the  analysis  for  models  with  session-level 
dynamics  tractable  under  a  certain  regime.  In  contrast,  our  limiting  model 
derived  here  is  compatible  with  other  previously  proposed  models  in  their 
respective  regime  without  having  to  rely  on  ad-hoc  assumptions.  The 
contributions  from  these  additional  layers  of  dynamics  to  the  asymptotic  queue 
are  now  crisply  revealed  through  the  limit  theorems.  Under  mild  assumptions,  we 
show  that  the  steady-state  queue  size  depends  on  the  hie  size  and  round-trip 
delay  only  through  their  mean  values. 

We  obtain  more  accurate  description  of  the  queue  dynamics  by  means  of  a 
Central  Limit  analysis  which  identifies  an  interesting  relationship  between  the 
queue  fluctuations  and  the  random  packet  marking  mechanism  in  AQM.  The 
analysis  also  reveals  the  dependency  of  the  magnitude  of  the  queue  fluctuations 
on  the  variability  of  the  hie  size  and  round-trip  delay.  Simulation  results 
supporting  conclusions  drawn  from  the  limit  theorems  are  also  presented. 
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Chapter  1 


Introduction 

1.1  Motivation  and  Goals 

One  of  the  key  mechanisms  for  the  operation  of  the  best-effort  service  Internet  is 
the  congestion-control  mechanism  in  the  transmission  control  protocol  (TCP) 

[26].  While  there  are  several  variations  to  the  basic  TCP  congestion-control 
mechanism,  they  all  have  in  common  the  additive  increase/multiplicative  decrease 
(AIMD)  algorithm.  This  AIMD  algorithm  enables  TCP  congestion-control  to  be 
robust  under  diverse  conditions.  Unfortunately,  the  self-clocking  feedback 
mechanism  of  the  AIMD  algorithm  does  introduce  some  additional  complexities 
into  the  behavior  of  network  traffic.  There  has  been  a  number  of  studies  trying  to 
gain  insights  into  this  complex  behavior.  At  the  present,  the  relationship  between 
the  throughput  of  a  single  TCP  flow  and  its  round-trip  and  loss  probability  is 
fairly  well  understood  [2,  40,  43,  48]. 

There  are,  however,  certain  aspects  of  TCP  that  are  not  well  understood  and 
which  cannot  be  analyzed  with  the  techniques  in  the  aforementioned  studies. 

This  includes  issues  such  as  buffer  behavior  at  a  bottleneck  router  and  the 
aggregate  throughput  when  many  TCP  flows  compete  for  the  bandwidth  of  a 
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link.  While  one  could  in  principle  extend  the  aforementioned  single  flow  models 
to  attempt  to  answering  these  questions,  the  resulting  models  (which  will  be 
referred  to  later  as  micro-scale  models )  are  not  scalable.  More  specifically,  since 
each  flow  is  modeled  in  great  details,  the  size  of  the  state  space  for  such  models 
explodes  when  the  number  of  flows  becomes  large.  Consequently,  not  only  is  the 
analysis  becoming  intractable,  even  numerical  calculations  or  simulations  of  such 
models  are  very  complicated,  computationally  prohibitive,  and  would  not  provide 
additional  advantages  over  full-scale  simulation  with  existing  simulation  packages 
(e.g.,  NS  [45]).  While  one  could  make  certain  assumptions  to  simplify  the 
analysis,  it  is  not  clear  what  are  the  irrelevant  details  that  can  be  omitted  while 
still  providing  a  reasonably  accurate  analysis. 

To  make  matters  worse,  recent  developments  in  Active  Queue  Management 
(AQM)  techniques  have  introduced  additional  complexity  in  transport  protocols. 
The  development  of  AQM  originated  from  a  well-known  fact  that  with  simple 
Tail-Drop  gateways,  the  AIMD  mechanism  in  TCP  congestion-control  leads  to 
undesirable  behavior,  i.e.,  global  synchronization.  When  several  TCP  flows 
compete  for  bandwidth  in  a  Tail-Drop  gateway,  it  has  been  observed 
experimentally  that  packets  from  many  flows  are  usually  discarded 
simultaneously  [63],  resulting  in  a  poor  utilization  of  the  network.  AQM 
algorithms  such  as  Random  Early  Detection  (RED)  [18]  and  Explicit  Congestion 
Notification  (ECN)  [15]  have  been  proposed  to  help  alleviate  this  problem  by 
randomly  dropping/marking  packets  depending  on  queue  size,  thereby  avoiding 
heavy  congestion  and  preventing  global  synchronization.  As  can  easily  be 
imagined,  the  introduction  of  AQM  further  exacerbates  the  difficulty  of 
understanding  issues  associated  with  buffer  behavior  and  aggregate  TCP  traffic. 
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There  have  been  attempts  to  model  the  interactions  between  TCP  and  AQM, 
but  so  far  the  analytically  tractable  models  are  either  too  crude  or  too  simplistic 
(as  to  be  discussed  in  the  literature  survey  in  Chapter  2). 

Furthermore,  Internet  TCP  traffic  is  composed  of  connections  with  diverse 
session  dynamics  and  round-trip  delays.  The  session  dynamic  of  a  flow  depends 
on  the  type  of  applications.  For  example,  applications  such  as  FTP  and  Telnet, 
are  relatively  long-lived,  while  other  types  of  applications  are  typically 
short-lived,  e.g.,  web  browsing.  Additional  modeling  difficulties  also  arise  from 
the  fact  that  the  information  (i.e.,  marks  on  packets)  from  AQM  to  TCP  flows  is 
exchanged  at  different  feedback  rates  depending  on  the  round-trip  delay  of  the 
flows.  All  of  these  features  create  major  obstacles  in  deriving  a  scalable  model 
which  can  cope  with  a  large  number  of  flows  and  still  yields  analytical  insights 
into  how  to  control  such  traffic.  As  a  result,  little  work  has  been  done  on  the 
problem  of  modeling  the  interaction  between  AQM  and  a  large  number  of  TCP 
flows  with  session  dynamics  and  variable  round-trip  delays. 

The  problem  of  modeling  network  traffic  and  of  understanding  its  impact  on 
various  performance  metrics  is  not  unique  to  TCP  networks.  Early  traffic 
modeling  efforts  in  telephony  revealed  the  suitability  of  Poisson  processes  to 
model  the  time  patterns  in  the  stream  of  call  requests  to  a  telephone  exchange. 
This  culminated  with  the  pioneering  work  of  A.K.  Erlang  who  proposed  the 
M\M\c\c  model  for  dimensioning  call  systems  [11].  Similarly,  it  is  well-known 
that  the  multiplexed  packet  traffic  generated  by  a  large  number  of  bursty  data 
sources  is  well  described  by  a  Poisson  process.  It  turns  out  in  retrospect  that  this 
so-called  “Poisson  modeling”  is  easily  justified  by  a  celebrated  limit  theorem  due 
to  Palm  [50]  and  Khintchine  [33].  According  to  these  results,  under  very  weak 
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technical  conditions,  the  superposition  of  point  processes  is  a  point  process  which 
converges  to  a  Poisson  (point)  process  when  the  number  of  superposed  point 
processes  grows  large. 

These  success  stories  point  to  the  possibility  of  systematically  applying  limit 
theorems  in  order  to  derive  traffic  models  (which  will  be  referred  later  as 
macro-scale  models).  The  advantages  of  doing  so  are  three-fold.  First,  model 
simplification  typically  occurs  when  applying  limit  theorems,  thereby  filtering  out 
irrelevant  details  without  relying  on  ad-hoc  assumptions.  Second,  since  limit 
theorems  are  central  to  the  Theory  of  Probability,  it  is  reasonable  to  expect  the 
existence  of  suitable  limit  theorems  (with  very  weak  assumptions)  which  can  be 
applied  to  the  traffic  model  of  interest.  Finally,  the  resource  allocation  problem  is 
interesting  only  in  networks  operating  at  high  utilization  such  as  when  the 
number  of  users  is  large.  In  such  a  scenario,  the  limit  behavior  becomes  more 
accurate  as  the  number  of  users  increases. 

We  would  like  to  again  stress  that  the  development  of  a  tractable  analytical 
framework  for  the  interaction  between  RED  (and  more  generally  AQM)  and  TCP 
is  important  in  practice  as  well.  While  AQM  mechanisms  such  as  ECN  and  RED 
are  widely  implemented  and  supported  in  the  new  generation  of  routers,  they  are 
either  rarely  utilized  or  otherwise  poorly  configured  due  to  the  lack  of  an 
understanding  of  how  to  configure  their  parameters.  In  fact,  May  et  al.  [41] 
advise  against  the  deployment  of  RED  in  spite  of  its  benefits  because  a 
misconfigured  RED  can  have  detrimental  effects  on  the  network  traffic.  It  is  the 
objective  of  this  thesis  to  show  that  simple  and  robust  models  can  be  derived  in  a 
systematic  manner  through  limit  theorems,  and  that  such  models  can  help 
improve  the  understanding  of  the  dynamics  and  encourage  the  deployment  of 
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well-engineered  AQM  mechanisms. 


1.2  The  Approach 

As  part  of  this  program,  we  consider  several  discrete-time  models  of  a  bottleneck 
RED  gateway  with  a  large  number  of  TCP  flows  competing  for  bandwidth 
resources.  In  this  thesis,  we  mainly  consider  RED  for  the  AQM  mechanism 
because  it  is  the  simplest  and  most  widely-deployed  AQM  mechanism.  However, 
many  of  the  conclusions  drawn  from  the  analytical  results  are  applicable  to 
models  combining  generic  window-based  congestion- control  mechanisms  and 
AQM  schemes.  Furthermore,  the  approach  can  be  generalized  to  apply  to  a  large 
class  of  models. 

All  the  models  considered  here  take  into  account  detailed  packet-level 
operations  unlike  other  existing  models  in  the  literature  (to  be  surveyed  in 
Section  2.3),  where  the  packet-level  operations  of  TCP  are  ignored  and  only  the 
evolution  of  the  transmission  rate  is  considered.  It  is  important  to  make  this 
distinction  because  mechanisms  in  TCP  and  AQM  rely  on  packet-level 
operations.  The  omission  of  this  level  of  details  simplifies  the  analysis  but  at  the 
expense  of  additional  distortion  in  the  results. 

The  models  under  investigation  in  this  dissertation  are  organized  around  the 
behavior  of  the  RED  buffer  contents  observed  at  discrete  epochs  (which  are 
indexed  by  t  —  0, 1, . . .).  They  differ  through  the  specific  mechanism  used  for 
generating  incoming  packets  in  response  to  the  congestion  that  develops  in  the 
RED  buffer  shared  by  the  TCP  flows.  Specifically,  in  the  initial  model  (to  be 
referred  to  as  the  rate-based  model  in  Chapter  3)  the  packet  transmission  rate  of 
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a  source  increases/decreases  in  response  to  random  packet  dropping  at  the  RED 
gateway.  This  simple  ersatz  model  is  easy  to  understand  and  provides  a  complete 
conceptual  picture  of  the  system  and  the  type  of  the  results  to  be  expected. 

Later,  we  embellish  this  model  by  incorporating  a  more  accurate  additive 
increase/multiplicative  decrease  (AIMD)  window  mechanism  of  TCP.  The  AIMD 
algorithm  controls  the  window  size  in  response  to  the  random  marks  from  the 
bottleneck  ECN /RED  gateway.  This  model  is  introduced  in  Chapter  4,  and  we 
later  refer  to  it  as  the  window-based  model. 

While  the  models  in  Chapter  3  and  4  shed  some  insights  on  the  interaction 
between  the  AIMD  algorithm  and  RED,  they  do  not  represent  a  realistic 
depiction  of  the  actual  Internet  because  they  ignore  the  session-level  dynamics, 
i.e.,  arrivals  and  terminations  of  flows,  and  the  variability  of  round-trip  delays. 
These  additional  layers  of  dynamics  will  be  incorporated  into  the  model  in 
Chapter  5. 

These  models  have  common  characteristics  when  taking  the  limit  as  the 
number  of  flows  grows  to  infinity.  First,  properly  normalized  aggregated 
quantities  ( e.g queue  size  and  throughput)  converge,  and  simplified  recursive 
dynamics  emerge  for  the  limiting  quantities.  Second,  it  is  reasonable  to  expect 
the  throughput  of  a  flow  to  be  asymptotically  independent  of  other  flows  since 
the  effect  of  any  single  flow  on  other  flows  becomes  less  pronounced  as  the 
number  of  flows  grows  large. 

Here  we  identify  the  necessary  ingredients  for  applying  a  limiting  process  to 
TCP  model  as  done  in  this  thesis:  First  of  all,  only  the  congestion  in  a  bottleneck 
link  is  considered  with  capacity  of  the  bottleneck  link  and  the  feedback 
information  rate  (from  AQM)  scaling  with  the  number  of  users.  For  example,  if 
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C  denotes  the  capacity  per  user  of  at  the  bottleneck  router,  then  its  overall 
capacity  should  scale  up  to  NC  when  N  TCP  flows  share  the  bottleneck  link. 
Also,  let  f^^x)  :  M+  — >  [0, 1]  denote  the  feedback  dropping/marking  probability 
function  in  RED  when  the  buffer  level  is  x  and  there  are  N  users  in  the  system, 
i.e.,  any  incoming  packets  when  the  buffer  level  is  x  is  randomly  marked/dropped 
with  probability  f(N\x).  Then  /(iV^(a;)  should  scale  with  N  in  the  sense  that 
there  exists  a  continuous  function  /  :  M+  — >  [0,1]  such  that  for  each  N  =  1,2,..., 

f^N\x)  =  /(lV_1a;),  x  >  0.  (1.1) 

We  are  interested  in  the  “snapshot”  of  the  dynamics  when  there  exists  N  flows  in 
the  system.  Therefore,  /  is  just  a  surrogate  function  representing  the  average 
contribution  that  each  flow  has  on  the  dropping/marking  probability. 

Next,  in  order  to  describe  the  interaction  between  the  bottleneck  router  and 
TCP  flows,  all  models  use  a  recursion  similar  to  Lindley’s  recursion  for  the 
quantity  of  interest.  In  our  study,  this  quantity  is  the  queue  size  at  the  RED 
buffer.  Let  Q^N\t)  denote  the  queue  size  at  the  discrete  time  epoch  t  when  the 
system  has  N  flows.  During  the  timeslot  [t,t  +  1),  it  is  possible  for  queue  size  to 
increase  (due  to  new  incoming  packets)  or  decrease  (because  the  packets  in  the 
queue  are  serviced/transmitted).  We  denote  the  amount  of  increase  and  decrease 
in  epoch  t  as  A^N\t)  and  C^N\t),  respectively.  As  the  queue  is  always 
non-negative,  we  can  write  the  following  recursion 

Q(JV)  (t  +  1)  =  [Q(iV)  (t)  +  AW (t)  -  C{N) (t)] +  .  (1.2) 

Both  A(N)(t)  and  C^N\t)  follow  some  stochastic  recursions  which  depend  on  the 
current  queue  size  and  thereby  forming  a  stochastic  feedback  system.  For 
example,  when  Q(iV^(t)  is  large,  A^N\t')  —  C'^7V-)(f/)  should  be  negative  with  high 
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probability  for  some  t'  >t  in  order  to  drive  the  queue  size  down. 
If  we  normalize  (1.2)  by  N,  the  resulting  recursion  becomes 


Q^N\t  +  1) 


N 


Q(N\t)  A^N\t)  C(N\t) 


(1.3) 


N  N  N 

If  Q{  ^ ,  A{  ^  and  ^  all  converge  in  the  same  sense,  e.g.,  in  probability, 
as  N  grows  large  to  random  variables  (rvs)  q(t),a(t )  and  c(f),  respectively,  then 
also  converges,  say  to  a  rv  q(t  +  1),  i.e., 

QW(t  +  1) 


N 


>N  q(t  +  1) 


(1.4) 


with 


1)  =  [q(t)  +  a(t)  -  c(t)]+,  (1.5) 

P 

where  — denotes  convergence  in  probability  when  N  tends  to  infinity.  This 
produces  a  recursion  for  the  rvs  {q(t),  t  —  0, 1, . . .}  with  initial  condition  g(0) 
and  driving  sequence  (a(t),  c(t),  t  —  0, 1, . . .}.  The  model  simplification  typically 
results  from  the  fact  that  the  limiting  rvs  a(t )  and  c{t)  are  either  deterministic 
quantities  or  rvs  which  are  easy  to  evaluate,  while  in  the  model  with  arbitrary 
number  of  flows  N,  we  typically  need  N  or  more  state  variables  to  evaluate  either 
AW(t)  or  CW(t).  The  original  quantity  Q^N\t)  can  be  estimated  by  N  ■  q(t). 
Since  q(t )  can  be  evaluated  independently  of  N,  the  estimation  can  be  done 
efficiently,  and  the  limiting  model  is  scalable. 

To  provide  a  sharper  approximation  of  Q(N\t),  we  can  seek  to  establish  the 
following  convergence 


for  some  rv  Lo(t)  where  convergence  in  distribution  is  denoted  by  when  N 

tends  to  infinity.  If  there  exists  such  a  rv  Lo(t),  then  (1.6)  yields  the 
(distributional)  approximation 

QW(t)nN-q(t)  +  y/NL0(t)  (1.7) 

in  some  distributional  sense.  Again,  the  rv  L0(t)  can  be  evaluated  independently 
of  N.  We  can  then  investigate  the  process  {q(t),  L0(t),  t  —  0, 1, . . .}  to  derive 
both  qualitative  and  quantitative  properties  concerning  of  the  interaction  between 
TCP  flows  and  AQM  mechanisms.  The  main  theme  of  this  thesis  therefore 
revolves  around  establishing  convergence  results  similar  to  (1.4)  and  (1.6),  and 
investigating  the  properties  of  the  limiting  recursions. 

1.3  Contributions 

In  this  section,  we  highlight  the  main  contributions  of  this  thesis.  The  results  are 
derived  from  various  models  with  common  characteristics  described  in  the 
previous  section.  Indeed,  the  recursion  of  the  RED  buffer  follows  (1.2)  with 
A^N\t)  and  C^N\t)  left  to  be  specified  by  the  models.  In  all  of  the  models, 
C(N\t)  represents  the  service  capacity  (in  packets)  of  the  bottleneck  RED 
gateway  which  satisfies  the  scaling  assumption  outlined  earlier, 
i.e.,  C^N\t)  =  NC,  t  =  0, 1, . . .,  for  some  positive  constant  C  where  C  denote 
the  average  capacity  per  flow.  Therefore,  the  main  differences  between  each 
model  are  in  specifying  the  packet  arrivals  of  the  TCP  sources  in  each  timeslot. 

In  a  distributed  congestion-control  scheme  such  as  TCP,  each  source  has  to 
adjust  its  transmission  rate  by  relying  only  on  the  local  information  at  the 
source.  This  local  information  can  be  the  state  variables  of  the  TCP 
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congestion-control  mechanism  and  the  past  events  (such  as  packet  losses)  that 
the  TCP  source  has  experienced.  If  A^N\t)  denotes  the  number  of  packets  TCP 
source  i  =  1, . .  . ,  N  injects  into  the  bottleneck  gateway  in  timeslot  [t,  t  +  1),  then 
the  aggregate  number  of  packets  coming  into  the  RED  buffer  in  timeslot  [t,  t  +  1), 
A^N\t)  is  given  by 

N 

=  (i.8) 

1=1 

Therefore,  Q^N\t)  +  A^N\t)  packets  are  available  for  transmission  during  that 
timeslot.  Since  the  outgoing  link  operates  at  the  rate  of  NC  packets/timeslot, 
[Q^it)  +  A(N\t)  —  NC]  +  packets  remains  during  timeslot  [t,t  +  1]  in  the 
buffer,  their  transmission  being  deferred  to  subsequent  timeslots.  The  number 
Q(N\t  +  1)  of  packets  in  the  buffer  at  the  beginning  of  timeslot  [t  +  1,  t  +  2)  is 
therefore  given  by 

Q{n) (t  +  1)  =  [Q(a° (t)  -  NC  +  AiN) (f)] +  .  (1.9) 

The  evolution  of  A[N\t)  depends  on  Q^N\t)  through  some  signaling 
mechanisms,  e.g.,  random  marking/dropping  of  packets  with  probability  being  a 
function  of  Q(N\t).  In  all  the  models,  we  assume  the  structural  condition  (1.1) 
on  how  the  marking/dropping  probability  function  scales  with  N. 

Under  the  aforementioned  framework,  the  main  theoretical  contributions  to 
be  reported  here  are: 

The  Weak  Laws  of  Large  Numbers  (WLLN) 

Under  different  models,  Theorems  1,  2,  and  4  show  that  the  dynamics  of  the 
RED  buffer  at  time  t,  denoted  as  Q(N\t),  can  be  approximated  by  Nq(t )  with 
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q{t)  determined  via  a  simple  deterministic  recursion,  which  is  independent  of  the 
number  of  users.  The  limiting  model  is  therefore  “scalable”  as  it  does  not  suffer 
from  the  state  space  explosion,  nor  does  it  require  any  ad-hoc  assumptions. 
Indeed,  we  have  the  convergence  (1.4),  so  that  the  approximation  becomes  more 
accurate  as  the  number  of  flows  becomes  large.  We  also  note  that  (1.4)  is  a 
byproduct  of  the  convergence 

sr^N  a(n)(+\ 

a(t)  =  E  [A(t)] ,  (1.10) 

for  some  constant  a(t)  and  rv  A(t)  determined  by  the  model.  The  convergence 
(1.10)  can  be  interpreted  as  a  Weak  Law  of  Large  Numbers  for  the  array 
{^(f),  N  =  1,2,...;  i  =  1, . . . ,  IV}. 

Moreover,  the  dependency  between  each  TCP  flow  with  RED  becomes 
negligible  under  a  large  number  of  flows.  This  finding  is  compatible  with  the 
belief  that  RED  breaks  the  global  synchronization  between  TCP  flows  [18]. 

These  Weak  Laws  of  Large  Numbers  also  provide  some  justification  for 
evaluating  the  dynamics  of  TCP/RED  (and  more  generally  TCP/AQM)  through 
deterministic  models  as  surveyed  in  Chapter  2.  The  Weak  Laws  of  Large 
Numbers  suggest  that  the  recursion  of  the  average  queue  q(t)  depends  only  on 
the  average  workload  input  to  the  network  a(t)  =  E  [d(f)]  and  the  distributional 
recursion  of  A(t)  is  closely  related  to  the  recursion  of  a  single  flow. 

Steady-state  analyses 

For  N  persistent  TCP  flows  when  the  bottleneck  capacity  is  NC,  in  non-trivial 
situations,  the  average  (steady-state)  throughput  of  each  flow  is  seen  to  be 
approximately  C  for  large  N,  so  that  TCP  traffic  does  not  realize  the  benefits 
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commonly  associated  with  statistical  multiplexing.  Indeed,  for  N  open-loop 
independent  traffic  sources,  each  operating  at  average  rate  C,  the  bandwidth 
required  to  serve  the  aggregate  flow  is  typically  less  than  NC  under  statistical 
multiplexing.  TCP  flows,  on  the  other  hand,  are  correlated  due  to  their  coupling 
via  the  bottleneck  router,  and  multiplexing  TCP  flows  is  not  as  effective. 

To  make  use  of  the  limiting  model  in  network  dimensioning,  the  queue  length 
at  the  steady-state  can  be  easily  calculated,  and  the  steady-state  distribution  of 
the  window  size  can  be  evaluated  from  well-known  TCP  models  [47,  48].  For 
TCP  flows  with  session  dynamics  and  variable  round-trip  delays,  we  propose 
estimating  the  steady-state  limiting  queue  size  by  a  simple  approximation  under 
mild  assumptions.  It  is  noteworthy  that  the  average  buffer  utilization  and 
average  window  size  at  the  steady-state  depend  on  the  RTT  and  on  the  file  size 
distributions  only  through  their  mean  values. 

The  Central  Limit  Theorems  (CLT) 

For  a  more  accurate  description  of  the  queue  dynamics,  Theorems  3  and  5 
present  a  Central  Limit  analysis  validating  the  convergence  (1.6)  for  some  rv 
L0(t),  provided  the  marking/dropping  probability  function  is  continuously 
differentiable.  The  rvs  {L0(t),  t  —  0, 1, . . .}  are  Gaussian  except  in  a  special 
technical  case  and  a  distributional  recursion  is  also  provided  for  generating  them. 

The  Central  Limit  analysis  leading  to  (1.6)  reveals  that  the  magnitude  of  the 
queue  fluctuation  is  proportional  to  the  derivative  of  /  around  the  limiting 
(normalized)  queue  size  q(t).  This  advocates  the  use  of  a  smooth  marking 
probability  function  as  suggested  in  the  RED  “gentle”  option  as  opposed  to  the 
original  recommendation  in  [16]  and  [18].  This  finding  coincides  with  the 
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observed  oscillatory  behavior  of  RED  when  the  average  packet  drop  rate  exceeds 
maxjp ,  in  the  absence  of  RED’s  “gentle”  modification  [14]. 

A  closer  inspection  at  the  CLT  results  reveals  the  sources  of  queue 
fluctuations  can  be  decomposed  into 

(a)  The  fluctuation  in  the  feedback  information,  i.e.,  the  difference  between  the 
limiting  feedback  probability  f(q(t ))  and  the  actual  feedback  probability 

/W(QW(t)). 

(b)  The  binary  nature  of  the  feedback  information,  i.e.,  the  sources  have  to  ap¬ 
proximate  the  feedback  probability  f^(Q^N\t))  from  whether  the  transmit¬ 
ted  packets  received  any  marks. 

(c)  The  variation  in  the  RTTs  and  hie  size  variations. 

The  limiting  rvs  representing  (b)-(c)  are  Gaussian  and  their  combined  effects  on 
the  queue  fluctuations  are  determined  through  the  protocol  structure. 

1.4  Thesis  Organization 

This  thesis  is  organized  as  follows:  In  Chapter  2,  we  begin  with  a  literature 
review  that  relates  to  our  work.  In  Chapter  3,  we  describe  a  simple  ersatz  model 
of  homogeneous,  persistent  rate-controlled  hows  which  react  to  the  events  in  the 
network,  i.e.,  congestion/no  congestion.  This  ersatz  model  illustrates  the  type  of 
results  than  can  be  expected  in  more  complex  situations.  Chapter  4  presents  a 
more  realistic  model  of  persistent  TCP  hows  which  incorporates  the  explicit 
AIMD  window  mechanism  of  TCP  reacting  to  the  marking  mechanism  in  the 
RED  bottleneck  gateway.  The  Weak  Law  of  Large  Numbers  of  the  model  is 
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presented  in  Chapter  4  along  with  its  Central  Limit  analysis.  In  Chapter  5,  the 
model  is  then  extended  to  incorporate  session-layer  dynamics  and  variable 
round-trip  delays  —  the  Weak  Law  of  Large  Numbers  and  a  Central  Limit 
analysis  are  established  for  this  model.  The  proofs  of  the  Weak  Laws  of  Large 
Numbers  are  given  in  Chapter  6  while  the  proof  of  the  Central  Limit  theorem  are 
given  in  Chapter  7.  Finally  in  Chapter  8,  we  summarize  the  work  that  has  been 
done  in  the  dissertation  and  discuss  future  research  directions. 


1.5  Notation 

Some  words  on  the  notation  in  use:  Equivalence  in  law  or  in  distribution  between 
random  variables  (rvs)  is  denoted  by  =st.  The  indicator  function  of  an  event  A  is 
given  by  1  [A],  and  we  use  —>n  (resp.  =^n)  to  denote  convergence  in  probability 
(resp.  weak  convergence  or  convergence  in  distribution)  with  n  going  to  infinity. 
For  scalars  a  and  b  we  write  a  V  b  =  max(a,  b)  and  a  A  b  =  min(a,  b ).  For  a  fixed 
positive  integer  N,  let  J\f  =  (1,  •  •  •  ,  N}  denote  the  set  of  sessions  and  let 
indicate  the  explicit  dependence  of  the  quantity  X  on  the  number  N.  The 
expectation  of  a  rv  X  with  a  distribution  function  F  is  given  by  either  E  [A"]  or 
E  \F}.  For  simplicity,  we  introduce  the  notation  lx  [a;]  and  Px  [#]  for  1  [. X  =  x] 
and  P  [X  =  x\,  respectively. 
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Chapter  2 


Literature  Survey 


This  chapter  discusses  both  previous  research  and  the  current  state-of-the-art 
relating  to  our  work.  The  chapter  is  organized  as  follows.  First  we  present  an 
overview  on  several  congestion-control  mechanisms  and  active  queue  management 
(AQM)  algorithms  in  Section  2.1.  Next,  we  survey  the  work  on  the  modeling  of 
these  congestion-control  mechanisms  and  their  interaction  with  AQM 
mechanisms.  We  categorize  different  modeling  approaches  into  two  classes.  The 
first  class  consists  of  models  derived  without  the  assistance  of  limit  theorems. 
Models  in  this  class  will  be  referred  later  as  micro-scale  models  and  are  presented 
in  Section  2.2.  The  second  class  consists  of  models  derived  via  limit  theorems. 

We  later  refer  to  models  in  this  class  as  macro-scale  models  and  present  them  in 
Section  2.3.  Models  which  will  be  developed  in  this  thesis  also  belong  in  this 
class. 


2.1  Overview  of  Congestion-Control  and  AQM  Mechanisms 

Two  major  classes  of  congestion-control  mechanisms  are  surveyed:  (i)  TCP 
congestion-control  mechanism  and  its  variants,  and  (ii)  Optimization-based 
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congestion-control  mechanisms.  For  AQM  mechanisms,  we  first  present  the 
earliest  and  most  widely-implemented  algorithm,  Random  Early  Detection 
(RED).  Other  popular  AQM  mechanisms  are  briefly  described  later. 

The  Internet  and  TCP  congestion-control 

The  Internet  was  developed  by  the  United  States  Department  of  Defense  prior  to 
and  concurrent  with  ISO  standard  activities.  The  resulting  internetwork,  known 
as  ARPANET,  has  now  been  extended  to  incorporate  internetworks  developed  by 
other  government  agencies,  academic  institutions  and  commercial  networks  .  The 
combined  internetworks  are  now  known  simply  as  the  Internet  [20]. 

The  protocol  suite  used  within  the  Internet  is  known  as  the  Transmission 
Control  Protocol/Internet  Protocol  (TCP/IP)  suite.  Since  all  the  protocol 
specifications  associated  with  TCP/IP  are  available  in  the  public  domain,  they 
have  been  used  extensively  by  commercial  and  public  organizations  for  creating 
open  system  networking  environments. 

The  Internet  Protocol  (IP)  main  functions  are  equivalent  to  those  of  the  OSI 
networking  layer.  These  functions  include  fragmentation  and  reassembly,  routing, 
and  error  reporting.  All  are  necessary  for  internetworking  across  dissimilar 
networks  [20]. 

The  Transmission  Control  Protocol  (TCP)  is  a  connection-oriented  transport 
protocol.  Its  service  offered  to  the  users  is  known  as  “reliable  stream  transport 
service”.  Its  functions  cover  (but  are  not  equivalent  to)  those  of  the  OSI 
Transport  layer  and  Session  layer.  Besides  establishing  and  terminating 
connections,  TCP  is  also  needed  for  end-to-end  control  over  the  “best-effort” 
service  of  IP.  The  role  of  TCP,  which  is  hidden  from  users,  is  to  adapt  the 
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sending  rate  of  the  source  to  the  capacity  of  the  network.  Without  such  a 
congestion- control  mechanism  in  TCP,  traffic  sources  could  saturate  one  or 
several  routers,  thereby  causing  many  losses  and  retransmissions  and  resulting  in 
congestion  collapse  [26]. 

At  the  core  of  TCP  is  a  dynamic  window-based  congestion-control  scheme 
introduced  by  Jacobson  [26].  The  window  mechanism  limits  the  number  of 
packets  sent  by  the  source  and  not  yet  acknowledged  by  the  destination.  This 
limit  on  the  number  of  unacknowledged  packets  is  referred  as  the  congestion 
window  size.  The  efficiency  of  the  congestion-control  mechanism  depends  greatly 
on  the  choice  of  the  window  size.  If  the  window  size  is  too  large,  the  source  could 
saturate  the  routers.  If  the  window  size  is  too  small,  the  link  may  not  be  fully 
utilized.  Due  to  the  complex  nature  of  the  Internet,  the  optimal  value  of  the 
congestion  window  size  is  not  known  a  priori  because  it  depends  on  dynamic 
parameters  of  the  network  such  as  the  number  of  connections  sharing  the  same 
links.  Moreover,  the  optimal  window  size  can  also  change  during  the  duration  of 
a  connection  because  of  the  time-varying  nature  of  the  Internet.  As  a  result,  the 
window  size  of  TCP  adapts  according  to  the  network  conditions. 

There  are  two  working  phases  in  the  TCP  congestion-control  mechanism.  The 
first  phase  is  called  the  slow  start  phase  where  the  window  size  is  increased  by 
one  packet  for  every  acknowledgment  received.  Therefore,  the  congestion  window 
size  doubles  approximately  every  round-trip  time1,  resulting  in  an  exponential 
increase  of  the  window  size  as  a  function  of  time.  The  second  phase  is  called  the 
congestion  avoidance  phase,  during  which  the  window  size  increases  linearly  as  a 

1  Round-trip  time  (RTT)  is  the  duration  of  time  since  the  beginning  of  the  transmission  of  a 
packet  to  the  destination  until  an  acknowledgment  from  the  recipient  is  correctly  received. 
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function  of  time.  TCP  initially  operates  in  the  slow  start  phase  trying  to  quickly 
gauge  the  available  bandwidth.  When  a  packet  loss  is  detected,  TCP  cuts  back 
its  window  size  trying  to  resolve  the  congestion  in  network,  and  then  switches  to 
the  congestion  avoidance  mode.  The  actual  implementation  of  these  mechanisms 
varies  with  each  version  of  TCP. 

The  most  popular  version  of  TCP,  called  Reno,  employs  an  additive  increase, 
multiplicative  decrease  (AIMD)  algorithm  [26].  In  this  algorithm,  the  TCP 
congestion  window  size  will  increase  by  approximately  one  packet  per  round-trip 
in  the  congestion  avoidance  phase.  When  congestion  is  detected  by  packet  loss, 
the  window  size  is  reduced  by  half.  As  a  result,  the  TCP  Reno  congestion 
window  oscillates  between  periods  of  under-utilization  and  over-utilization  of 
network  resources.  Under  certain  conditions,  it  can  be  shown  that  TCP  Reno 
flows  with  the  same  round-trip  delay  competing  for  the  bandwidth  in  a  single 
bottleneck  have  equal  long-term  average  throughput  [7]. 

In  order  to  avoid  such  an  oscillatory  behavior,  a  new  version  of  TCP,  called 
Vegas,  was  proposed  in  [9].  The  main  idea  is  to  use  the  delay  information  of  the 
packets  instead  of  packet  losses  in  order  to  adjust  the  congestion  window  size. 
TCP  Vegas  can  stabilize  the  window  size  close  to  the  optimal  value  without 
introducing  periodic  oscillations.  However,  the  incompatibility  between  TCP 
Reno  and  Vegas  is  well-documented  [44]  and  presents  a  major  obstacle  in  the 
deployment  of  TCP  Vegas.  Therefore,  it  is  reasonable  to  expect  that  TCP  Reno 
will  remain  the  dominant  congestion-control  algorithm  in  the  Internet  for  the 
foreseeable  future. 
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Optimization-based  congestion-control 


One  approach  which  has  received  much  attention  recently  is  to  model  TCP 
throughput  as  the  solution  to  a  utility  maximization  problem.  The  interest  in 
this  model  originates  from  the  work  of  Kelly  [31].  It  shows  that  the  utility 
maximization  problem  of  the  system  composed  of  a  network  and  of  users  can  be 
decomposed  into  two  separate  problems,  namely  the  network  problem  and  the 
users  problem,  assuming  the  utility  function  of  each  user  is  a  concave  function  of 
the  user’s  received  throughput.  If  the  network  maximizes  its  revenue  and  then 
users  subsequently  maximize  their  utilities,  the  recursive  maximization  sequences 
will  converge  to  the  solution  of  the  system  utility  maximization  problem. 
Subsequent  work  by  Kelly  et  al.  [30]  shows  that  the  user  problem  can  be  solved 
by  a  rate  control  algorithm  which  can  be  implemented  as  a  congestion-control 
algorithm.  The  rate  control  algorithm  depends  on  the  form  of  the  utility  function 
of  the  user.  Recent  work  by  Low  [36]  shows  that  the  throughput  of  the  AIMD 
TCP  congestion-control  algorithm  solves  the  user  problem  with  a  specific  utility 
function.  Furthermore,  AQM  mechanisms  such  as  RED  can  be  modeled  as  the 
feedback  functions  from  the  network  to  the  users.  This  enables  efficient  modeling 
of  a  large  network  with  multiple  users  and  makes  it  possible  to  predict  the 
interactions  between  the  network  and  traffic  flows  through  the  choices  of  utility 
functions  and  network  parameters. 


Random  Early  Detection  (RED)  gateway 

Random  Early  Detection  (RED)  is  one  of  the  earliest  AQM  mechanisms,  and  is 
currently  the  most  widely-implemented.  It  was  proposed  by  Floyd  and  Jacobson 
[18]  to  keep  the  queue  size  small,  reduce  burstiness  and  solve  the  problem  of 
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global  synchronization  in  the  Internet,  i.e.,  when  several  TCP  flows  compete  for 
bandwidth  in  a  tail-drop  gateway,  packets  from  many  flows  are  usually  discarded 
(near)  simultaneously  [63] ,  resulting  in  a  poor  utilization  of  the  network. 

To  accomplish  these  goals,  RED  signals  incoming  flows  of  incipient  congestion 
by  randomly  dropping  incoming  packets  with  a  probability  depending  on  the 
exponentially  averaged  queue  length.  As  a  result,  flows  are  notified  and  have 
enough  time  to  react  to  growing  congestion  before  the  actual  queue  is  full.  RED 
is  designed  to  work  in  conjunction  with  congestion-controlled  TCP  flows.  RED 
also  incorporates  a  marking  mechanism  to  signal  flows  of  congestion  through  the 
Explicit  Congestion  Notification  [15]  option  in  the  IP  packet  header. 

The  introduction  of  RED  initiated  a  whole  new  research  area  in  active  queue 
management.  There  are  now  many  proposed  variations  of  RED  and  other  similar 
AQM  schemes,  some  of  which  will  be  surveyed  in  the  next  subsection.  However, 
the  interaction  between  AQM  schemes  and  Internet  traffic  even  for  a  simple 
AQM  scheme  such  as  RED  is  still  not  well  understood  as  there  exist  several 
conflicting  reports.  For  example,  while  it  is  claimed  that  RED  can  avoid  global 
synchronization  in  [18],  it  is  reported  in  [8]  that  the  number  of  consecutive  packet 
drops  in  RED  is  higher  than  in  tail-drop  gateway,  and  therefore  that  RED  does 
not  help  alleviate  the  global  synchronization  problem. 

Other  active  queue  management  mechanisms 

This  section  provides  a  brief  overview  on  some  other  interesting  AQM 
mechanisms.  Although  all  of  these  schemes  mark  or  drop  packets  to  signal  flows 
of  incipient  congestion  similarly  to  RED,  they  have  more  complex  mechanisms  to 
determine  the  feedback  probability. 
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The  PI  controller  [24]  is  proposed  as  a  follow-up  to  the  work  in  [22]  that 
models  the  TCP/AQM  interaction  in  the  framework  of  a  linearized  feedback 
control  system.  The  proposed  PI  controller  improves  the  transient  behavior  and 
forces  the  queue  to  achieve  the  target  buffer  utilization  in  the  steady-state. 

BLUE  [13]  is  an  active  queue  management  mechanism  which  uses  packet 
losses  and  link  idle  events  to  adjust  the  packet  dropping  probability  instead  of 
the  average  queue  size  in  RED.  By  decoupling  the  queue  length  from  congestion 
management,  the  authors  claim  that  BLLIE  can  reduce  the  packet  loss  rate  in  the 
queue  and  requires  smaller  queue  in  order  to  achieve  the  stable  operating  regime. 

Stabilized  RED  [46]  uses  a  packet  sampling  technique  to  estimate  the  number 
of  flows  utilizing  the  bottleneck  link.  The  appropriate  dropping  probability  in  the 
bottleneck  link  is  then  computed  on  the  basis  of  this  estimate,  since  the  number 
of  flows  is  an  indicator  of  the  fraction  of  the  queue  which  would  be  reduced  by 
triggering  the  multiplicative  decrease  mechanism  in  TCP  congestion-control. 

A  modification  to  RED  called  Adaptive  RED  is  proposed  in  [17].  In  this 
scheme  a  target  queue  utilization  is  selected  and  the  marking/ dropping 
probability  is  adjusted  by  an  additive  increase/multiplicative  decrease  mechanism 
on  every  predefined  interval.  If  the  queue  is  smaller  than  the  target  queue,  the 
marking  probability  is  decreased  multiplicatively,  otherwise  it  is  increased  by 
some  small  constant.  It  is  shown  in  [35]  that  Adaptive  RED  is  insensitive  to  the 
network  load  and  induces  smaller  fluctuation  than  RED. 

Kelly  et  al.  introduces  a  virtual  queue  mechanism  as  an  alternative  to 
congestion  management  [31].  The  virtual  queue  mechanism  maintains  a  fictitious 
queue  with  a  smaller  capacity  and  buffer  size  than  the  actual  queue.  For  every 
packet  arrival  or  departure  in  the  actual  queue,  the  virtual  queue  is  updated. 
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Whenever  the  virtual  queue  overflows,  any  packet  arrival  in  the  actual  queue  is 
marked/dropped  to  signal  the  sources  of  incipient  congestion.  The  virtual  queue 
mechanism  requires  smaller  queue  to  operate  than  RED  but  cannot  remedy  the 
flow  synchronization  problem. 

A  problem  with  the  virtual  queue  mechanism  is  that  the  appropriate  capacity 
of  the  virtual  queue  depends  on  both  the  load  and  the  round-trip  delay  of  the 
flows.  Kunniyur  and  Srikant  suggest  an  improved  virtual  queue  mechanism  called 
Adaptive  Virtual  Queue  [34],  in  which  the  capacity  of  the  virtual  queue  is 
adjusted  through  a  slower  control  loop.  It  is  claimed  that  the  Adaptive  Virtual 
Queue  mechanism  yields  very  high  utilization,  very  small  packet  loss  and 
queueing  delay  for  a  wide-range  of  traffic  load. 

A  very  desirable  property  for  AQM  mechanisms  is  to  penalize  or  regulate 
aggressive/ unresponsive  flows.  CHOKe  [51]  is  a  very  simple  mechanism  operating 
on  top  of  the  original  RED  algorithm.  It  can  be  shown  that  the  simple,  stateless 
CHOKe  algorithm  can  guarantee  a  bound  on  the  throughput  of  unresponsive 
flows. 

It  is  easy  to  imagine  that  the  additional  complexities  in  these  AQM 
mechanisms  further  complicate  the  analysis  of  the  TCP/AQM  interaction.  A 
systematic  evaluation  of  different  AQM  mechanisms  is  needed  to  verify  and 
compare  the  claims  in  each  study.  It  is  our  hope  that  the  framework  proposed  in 
this  thesis  can  be  generalized  to  encompass  these  AQM  mechanisms. 
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2.2  Micro-scale  Modeling  of  Congestion-Controlled  Flows 


In  this  section,  we  review  some  of  the  existing  work  on  micro-scale  traffic 
modeling  of  a  congestion-controlled  traffic  flow  as  well  as  the  efforts  made  to 
extend  these  micro-scale  models  to  the  situation  with  many  interacting  flows. 

TCP  modeling  by  Markov  processes 

One  of  the  earliest  and  most  popular  efforts  to  model  TCP  made  use  of  Markov 
chain  modeling.  The  size  of  the  TCP  congestion  window  acts  as  the  state  of  the 
chain  and  the  loss  probability  (either  independent  or  dependent  on  the  state) 
determines  the  transition  probabilities.  Padhye  et  al.  have  derived  an 
approximation  for  the  steady-state  distribution  of  a  Markov  chain  modeling  a 
single  TCP  connection  with  a  fixed  loss  probability  [48].  Altman  et  al.  [2] 
extended  the  model  to  cover  more  general  loss  processes,  e.g.,  continuous-time 
Markov  chain  with  different  loss  probability  in  each  state. 

Extension  of  micro-scale  TCP  model  to  many  TCP  flows 

In  this  section,  we  point  to  various  efforts  for  extending  the  micro-scale  TCP 
model  to  many  TCP  flows.  In  the  models  to  be  described,  there  is  no  attempt  to 
consider  a  limiting  model. 

The  resulting  models  suffer  from  either  one  of  the  following  difficulties:  (i) 
These  models  are  typically  too  complex  and  therefore  not  tractable  analytically. 
Sometimes  this  can  also  translate  in  numerical  calculations  being 
computationally  prohibitive;  (ii)  Reduction  in  complexity  can  be  achieved  by 
making  simplifying  assumptions  which  are  often  ad-hoc,  difficult  to  justify,  and 
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sometimes  downright  unrealistic.  We  argue  that  these  assumptions  should  emerge 
in  a  natural  way  from  the  traffic  modeling  process,  rather  than  being  enforced  for 
the  main  reason  that  they  enable  the  analysis  to  go  through.  It  will  soon  become 
apparent  that  a  modeling  methodology  based  on  limit  theorems  provides  a  direct 
remedy  to  this  issue  -  Indeed,  model  simplifications  occur  in  the  limit,  without 
the  need  for  any  ad-hoc  assumptions. 

Hasegawa  et  al.  [21]  consider  a  Markov  chain  for  N  TCP  connections  using 
either  Tail-Drop  or  RED  gateway.  The  state  space  in  this  system  is  the  vector  of 
window  size  of  all  of  N  connections  and  the  queue  size,  which  is  a  function  of  the 
sum  of  the  size  of  congestion  windows.  For  each  connection,  the  transition 
probability  depends  on  the  current  window  size  and  queue  size,  hence  on  the 
window  size  of  all  connections.  The  authors  then  derive  a  fixed-point  solution  of 
the  average  window  size  in  steady-state.  However,  this  fixed-point  solution  is 
very  complicated  and  requires  solving  a  large  system  of  non-linear  equations.  It 
is  not  clear  how  this  could  be  accomplished  effectively  and  whether  it  would  offer 
any  analytical  understanding  to  the  problem. 

Another  example  is  the  work  by  Garetto  et  al.  [19]  where  a  closed-queueing 
network  model  is  proposed  to  investigate  the  interaction  between  many  TCP 
flows.  A  closed  network  of  M\M\oo  queues  is  introduced  where  each  queue 
represents  a  state  of  the  TCP  algorithm.  The  number  of  users  in  each  queue 
represents  the  number  of  TCP  connections  in  that  state.  The  service  rate  of  the 
user  in  each  queue  depends  on  the  TCP  state.  After  completing  service  in  a 
queue,  each  user’s  transition  to  the  next  queue  depends  on  the  loss  probability. 
This  closed- queueing  network  is  used  in  a  two-tier  model  which  can  be  described 
as  follows:  Given  a  loss  probability,  a  numerical  calculation  of  the  steady-state 
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distribution  of  the  closed-queueing  network  yields  an  approximated  traffic  load. 
This  load  will  be  input  into  a  pseudo- network  model  (e.g.,  M\M\1\B)  to 
determine  the  new  loss  probability  for  the  closed-queueing  network.  If  the  result 
from  the  successive  iterations  converges,  it  is  claimed  that  the  convergence  will 
be  to  the  same  working  point  as  given  by  the  model  where  each  TCP  flow  is 
modeled  in  details. 

Several  questions  arise  concerning  such  a  model.  First,  the  model  implicitly 
assumes  that  the  dynamics  of  the  TCP  congestion-control  mechanism  converge  to 
steady  state  faster  than  the  network  dynamics;  this  is  the  opposite  of  the  real 
Internet  where  TCP  reacts  to  events  in  the  network.  Next,  while  the  numerical 
calculations  are  not  as  complex  as  would  be  the  case  in  a  detailed  model  of  TCP 
flows,  they  are  far  from  simple.  The  example  considered  in  [19]  consists  of  11 
different  queue  types  and  the  number  of  queues  (Mq)  is  357.  Calculations  for  the 
steady-state  distribution  of  the  queue  are  typically  of  the  order  O(M^)  for  each 
iteration.  Finally,  the  “interaction”  between  TCP  flows  is  developed  in  an 
abstract  model  where  its  deviation  from  the  actual  interaction  process  cannot  be 
quantified.  It  also  impossible  to  draw  any  analytical  conclusion  concerning  these 
interactions. 

Baccelli  et  al.  [4]  propose  fixed-point  methods  for  the  simulation  of  the 
sharing  of  a  local  loop  by  a  large  number  of  interacting  homogeneous  TCP 
connections.  The  analysis  uses  a  detailed  description  of  one  TCP  connection  and 
a  simplified  description  of  the  interaction  with  other  connections.  It  is  again 
difficult  to  quantify  the  effect  of  such  simplifications  and  the  accuracy  of  the 
results  can  only  be  verified  by  extensive  simulation. 
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Fluid  approximation  of  TCP  congestion-control  mechanism 


Mathis  et  al.  [40]  use  a  continuous-time  fluid  flow  approximation  to  the  discrete 
time  process  describing  window  behavior.  Assuming  the  congestion  signals  TCP 
to  back  off  according  to  a  Poisson  process,  where  the  kth  signal  occurs  at  time 
epoch  rk  (k  —  1,  2, . . .),  then  the  approximate  evolution  of  the  congestion  window 
evolution  of  a  single  TCP  flow  is  governed  by 


dW(t)  _  1 

dt  ~  w(ty 

except  at  the  points  rk  (k  —  1,2,.. .)  where 


(2.1) 


W(t+)  =  W(  r*-)/2. 


(2.2) 


These  equations  can  be  used  to  derive  the  fact  that  that  the  average  window  size 
is  of  the  order  of  1  /  y/p.  This  model  is  suitable  only  for  a  single  flow  because  the 
congestion  notification  is  assumed  to  be  Poisson  and  each  notification  is 
independent  of  each  other  (an  assumption  similar  to  [43]  which  uses  stochastic 
differential  equation  to  model  TCP).  When  more  than  one  flow  utilizes  the  same 
bottleneck  link,  this  assumption  is  not  helpful  for  capturing  the  interaction 
between  flows.  Bonald  [7]  considers  a  similar  model  for  several  TCP  flows  with 
the  major  difference  that  congestion  occurs  when  the  sum  of  congestion  windows 
exceeds  the  bandwidth-delay  product  plus  the  bottleneck  buffer.  Under  the 
assumption  that  at  every  congestion  epoch  all  TCP  flows  simultaneously  back 
off,  TCP  can  be  shown  to  be  fair  and  an  explicit  closed-form  formula  for  the 
utilization  can  be  derived.  However,  the  assumption  of  total  synchronization 
between  flows  is  unrealistic. 

While  there  are  many  advantages  of  viewing  the  system  as  a  utility 
maximization  problem  and  of  modeling  TCP  traffic  as  rate-controlled  fluid  flows, 
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there  are  definite  drawbacks  as  well.  Most  important  of  all  is  the  absence  of 
“packets”  from  the  system  model,  although  the  congestion-control  mechanism  of 
TCP  relies  on  packet-level  operations.  For  example,  it  is  not  possible  under  this 
model  to  derive  the  queue  length  distribution  at  routers  which  is  an  important 
quantity  for  any  successful  network  dimensioning.  Additionally,  while  the 
solution  to  the  maximization  problem  might  accurately  describe  the  steady-state 
solution  of  TCP,  the  distributed  solution  does  not  necessarily  capture  the 
short-term  dynamics  of  TCP  well.  Finally,  the  numerical  calculation  of  the 
solution  is  still  very  complex  as  it  suffers  from  state  space  explosion  when  the 
number  of  TCP  flows  becomes  large.  This  type  of  models  appears  more  suitable 
for  understanding  the  “big  picture”  and  the  qualitative  behavior  of 
congestion-control  algorithms,  rather  than  for  effective  and  accurate  network 
dimensioning. 

Stability  of  the  congestion-control  and  AQM  mechanisms 

As  previously  mentioned,  the  interaction  between  the  congestion-control  and 
AQM  mechanisms  can  be  analyzed  as  feedback  control  systems.  Using  the 
optimization-based  framework,  Kelly  et  al.  have  shown  the  stability  of  the 
system  in  the  absence  of  delay  through  a  Lyapunov  function  argument  [30]. 
Subsequently,  a  lot  of  results  have  been  published  on  both  the  local  and  global 
stability  of  the  system  with  arbitrary  delay  either  in  the  context  of  the  classical 
TCP  congestion-control  algorithm  or  of  utility-based  congestion  controllers 
(e.g.,  [28,  38,  49,  54]). 

While  the  aforementioned  work  yields  great  insights  into  the  dynamics  of  the 
control  system  that  governs  the  congestion-control /AQM  interaction,  questions 
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remain.  First  of  all,  these  results  are  based  on  deterministic  models  of 
feedback-delay  systems.  However,  the  relationship  between  the  actual  systems 
(which  are  stochastic)  and  the  deterministic  models  are  not  apparent.  For 
example,  what  is  the  relationship  of  the  probabilistic  marking  mechanism  in 
AQM  implementation  to  the  shadow  price  in  the  utility-based  model? 
Furthermore,  the  stability  conditions  often  assume  a  small  number  of 
homogeneous  flows  in  order  to  obtain  analytically  tractable  results  due  to  the 
complex  dynamics  and  large  state-space  involved.  Sometimes,  the  stability 
conditions  are  stated  in  the  “worst-case”  conditions,  e.g.,  the  largest  delay,  which 
could  be  extremely  conservative. 

This  thesis  remedies  to  these  shortcomings  as  follows.  First,  it  provides  a 
justification  on  using  the  deterministic  models  to  analyze  the  stability  of  the 
systems.  More  specifically,  in  the  large  number  of  flows  regime,  the  dynamics  of 
the  average  queue  depends  only  on  the  dynamics  of  the  average  traffic  flows 
in/out  of  the  queue.  The  dynamics  of  the  average  traffic  are  also  closely  related 
to  the  dynamics  of  a  single  flow  in  the  case  of  homogeneous  flows,  lending 
support  to  the  practice  of  considering  the  simple  deterministic  models.  For 
systems  with  heterogeneous  round-trip  delays,  we  also  derive  a  simple  recursion 
for  the  average  queue  and  traffic  flow.  By  analogy  with  the  homogeneous 
round-trip  case,  we  expect  that  a  simple  deterministic  model  should  also  exist  for 
the  analysis  of  the  stability  of  the  system.  This  would  relax  the  stability 
conditions  as  we  do  not  have  to  rely  on  the  worst  case  conditions. 


Modeling  of  TCP  traffic  with  session-layer  dynamics 

Existing  literature  on  short-lived  TCP  traffic  modeling  usually  relies  on  ad-hoc 
assumptions,  which  ensure  the  model  accuracy  only  in  certain  regimes.  Hollot  et 
al.  model  short-lived  TCP  flows  as  exponential  pulses,  i.e.,  time-shifted, 
increasing  exponential  functions  of  limited  durations,  whose  interarrival  times  are 
exponentially  distributed,  i.e.,  Poisson  process  [23].  The  statistics  of  these 
exponential  pulses  can  be  characterized  through  a  time-reversal  of  a  well-known 
class  of  processes  called  shot  noise  processes.  This  model  assumes  that  the 
short-lived  flows  last  only  a  few  round-trip  times  and  do  not  experience  packet 
drops  or  marks,  thereby  implicitly  assuming  that  congestion  level  is  relatively 
low.  Furthermore,  flows  are  always  in  either  congestion  avoidance  (long-lived 
connections)  or  slow  start  (short-lived  connections),  and  do  not  transition  from 
one  to  the  other.  In  other  words,  the  session  dynamics,  where  connections  arrive 
and  leave  the  network  after  transfers  are  completed,  are  not  explicitly  modeled. 

A  similar  approach  to  modeling  short-lived  flows  is  also  taken  in  [42], 

On  the  other  end  of  the  spectrum,  Kherani  and  Kumar  suggest  that  as  the 
bottleneck  capacity  becomes  very  small,  the  queueing  model  for  the  bottleneck 
queue  can  be  accurately  described  as  a  processor  sharing  queue  [32],  When  the 
capacity  is  large,  however,  the  processor  sharing  model  becomes  less  accurate 
because  newly  arrived  TCP  flows  cannot  fully  utilize  their  allocated  bandwidth. 
In  fact,  in  the  large  capacity  regime  these  short-lived  flows  may  terminate  even 
before  they  can  increase  their  transmission  rates  to  fully  utilize  their  allocated 
bandwidth  clue  to  slow  start. 

In  Chapter  5,  we  analyze  a  model  which  explicitly  incorporates  the 
session-layer  dynamics  and  show  that  the  limiting  model  agrees  with  these 
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models  in  their  respective  regime,  i.e.,  when  the  capacity  is  either  very  large  or 
very  small. 

Modeling  of  TCP  traffic  with  heterogeneous  round-trip  delays 

Little  work  has  been  done  on  modeling  the  interaction  between  an  AQM 
mechanism  and  a  large  number  of  TCP  flows  with  diverse  delays.  Some  initial 
investigation  of  the  role  of  heterogeneous  RTTs  into  such  an  interaction  is 
presented  in  [38].  However,  the  study  is  limited  to  a  small  number  of  TCP  flows, 
e.g.,  less  than  a  hundred  flows.  Additional  modeling  difficulties  arise  from  the 
fact  that  the  feedback  information  (i.e.,  marks  on  packets)  from  the  AQM 
mechanism  to  TCP  flows  arrives  at  different  rate  depending  on  the  RTTs  of  the 
connections.  These  obstacles  create  a  considerable  difficulty  in  deriving  a  scalable 
model  that  can  capture  the  important  aspect  of  Internet  traffic  dynamics  and 
yield  insights  into  how  to  control  it. 

The  role  of  the  variable  round-trip  delays  to  the  interaction  between  TCP  and 
AQM  interaction  in  the  large  number  of  flows  regime  is  investigated  in  details  in 
Chapter  5. 

2.3  Macro-scale  Modeling  of  Congestion-Controlled  Flows 

In  this  section,  we  outline  the  type  of  results  that  can  be  expected  from  modeling 
TCP  with  a  large  number  of  homogeneous  TCP  flows  via  limit  theorems. 

All  of  the  existing  models  surveyed  in  this  section  ignore  the  packet-level 
operations  of  TCP,  and  consider  only  the  evolution  of  the  transmission  rate, 
while  models  in  this  dissertation  take  into  account  detailed  packet-level 
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operations.  It  is  important  to  make  this  distinction  because  the  actual  operations 
of  TCP,  of  RED  and  of  the  network  rely  on  the  packet-level  operations.  The 
omission  of  this  level  of  details  simplifies  the  analysis  but  at  the  expense  of 
additional  distortion  in  the  results.  For  example,  it  will  be  demonstrated  in 
Theorem  5  that  the  ECN  marking  in  packets  introduce  additional  fluctuation  in 
the  limiting  queue.  This  fluctuation  cannot  be  captured  through  the  models 
without  packet-level  operations. 

Throughout  this  section,  we  use  a  common  notation  x[N\t)  to  denote  the 
transmission  rate  of  flow  i  (i  =  1, . . . ,  N)  at  time  t. 

Shakkottai  and  Srikant  [55]  consider  a  discrete-time  model  of  N  homogeneous 
proportional-fair,  congestion-controlled  flows  (such  as  the  primal  algorithm  in 
[30]).  These  flows  utilize  a  bottleneck  router  with  either  rate-based  marking 
(such  as  Virtual  Queue  marking)  or  queue-based  marking  (such  as  RED),  i.e., 

xf\t  +  1)  =  +  A  -  f3xf\t  -  d)f{N)(x[N\t  -  d)  +  ef\t  -  d))J  , 

where  A  and  /3  are  positive  constants  which  determine  the  rate  at  which  a  flow 
increases  and  decreases  its  transmission  rate,  /  is  the  marking  function,  d  is  the 
round-trip  delay  between  the  flow  and  the  bottleneck  router,  and  e is  a 
“noise”  process,  representing  short-lived  and  uncontrolled  flows.  A  natural  way 
to  obtain  a  simplified  limiting  model  resorts  to  rescaling  the  length  of  timeslots 
to  be  inversely  proportional  to  the  number  of  flows  N.  Then,  as  N  gets  large,  the 
average  transmission  rate  is  expected  to  converge  almost  surely  to  a  deterministic 
quantity  (under  appropriate  assumptions  on  the  “noise”  process  and  the  function 
/)  .  This  quantity  can  be  described  by  a  functional  differential  equation,  thereby 
justifying  deterministic  fluid  approximations  of  the  algorithms  described  in 
Section  2.2.  However,  the  aforementioned  approach  does  not  apply  well  to  the 
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Internet  with  TCP  congestion-control  because  (i)  the  model  ignores  the 
complexity  of  the  window-based  implementation  and  considers  only  the 
transmission  rate  of  the  flow,  and  (ii)  the  congestion-control  algorithms 
considered  are  also  derived  from  the  solution  to  utility  maximization  problems 
(as  described  in  Section  2.1),  whence  continuous  in  their  fluid  limit  and  thus 
“nicer”  than  TCP  congestion-control  algorithm  which  may  experience  abrupt 
transmission  rate  changes.  It  is  still  an  open  technical  problem  on  whether  there 
exists  a  fluid  limit  for  AIMD  TCP  congestion-control. 

Hong  and  Lebedev  [25]  consider  a  model  where  the  throughput  of  each 
connection  evolves  at  time  epochs  when  congestion  occurs.  At  every  epoch,  each 
connection  draws  a  {0.5, 1}  random  number  representing  the  fraction  of 
throughput  that  the  connection  retains  after  congestion.  This  fraction  depends 
on  the  throughput  of  the  flow  just  before  congestion  occurred.  If  Tn  is  the  nth 
congestion  epoch  and  7 i,n(x)  is  a  {0.5,  l}-valued  rv  depending  on  the  rate  x,  then 


with  an  obvious  notation.  Let  NC  be  the  capacity  of  the  bottleneck  router,  and 
assume  that  the  residual  capacity  at  time  T+,  namely  NC  —  YliLi  xiN\Tn  ),  is 
divided  evenly  among  all  users.  Then,  2 

*!w)rc*,) = C\n) + c  -  i  to- 

Denote  x the  transmission  rate  of  connection  i  right  after  the  nth-e poch. 

2The  authors’  model  is  flawed  since  the  transmission  rate  in  the  model  can  be  negative. 
However,  the  extension  of  the  model  to  incorporate  a  restriction  on  the  rate  to  be  only  positive 
should  not  change  the  nature  of  the  result. 
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Under  certain  conditions  on  the  rv  7 jiTl,  the  following  limits  exist: 


^linr  jf  in  -^1  and  a.s., 


r(°°) 


=  lim 


iV— >00 


E 


rA0 


.  Moreover,  the  sequence 


where  sn  =  E 

{ri“\  n  —  0, 1, . . .}  is  stationary  and  ergodic,  and  can  be  characterized  by  the 
relation 


d+!  =„  t„+i  Ur’  +  c  -  e  [4»>] )  •  (4“)  +  c  -  e  [4”>] ) , 

where  7n+i(x)  is  a  rv  which  has  the  same  distribution  as  7i,n+i(x). 

This  result  suggests  the  existence  of  a  simpler  process  for  describing  the 
asymptotic  behavior  of  the  average  rate.  Although  the  model  unrealistically 
assumes  that  the  multiplicative  decrease  in  a  TCP  flow  depends  only  on  its  own 
transmission  rate  and  not  on  that  of  other  flows,  we  will  show  in  the  next  section 
that  similar  results  also  exist  in  more  elaborate  models  with  many  TCP  flows 
sharing  a  RED  gateway. 

Adjih  et  al.  [1]  consider  a  continuous-time  model  where  partial  differential 
equations  of  the  free  buffer  space  in  a  Tail-Drop  gateway  and  the  density  of  the 
window  size  distribution  are  specified.  Under  the  assumption  that  the  buffer  in 
the  bottleneck  router  scales  with  the  number  of  users,  some  asymptotic  results  on 
both  the  free  buffer  space  and  the  window  distribution  are  established  by  the  use 
of  mean-field  approximations. 

Baccelli  et  al.  [5]  consider  a  continuous-time  model  where  the  window 
evolution  for  each  user  is  described  by  a  stochastic  differential  equation.  For  each 
of  the  window,  the  rate  grows  linearly  and  inversely  proportional  to  the 
round-trip  delay.  However,  the  window  size  will  be  cut  in  half  at  a  random  time 
determined  by  a  modulated  Poisson  process.  The  modulated  Poisson  processes 
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for  all  flows  are  coupled  through  a  common  rate  function  which  depends  on  the 
queue  size.  As  the  number  of  flows  grow  large,  the  average  queue  size  and  the 
window  size  can  be  determined  through  a  deterministic  mean-field  system. 
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Chapter  3 


A  Rate-based  Model  for  TCP/RED 


In  this  chapter,  we  present  a  stochastic  model  that  captures  the  essential  features 
of  TCP  congestion-control,  i.e.,  the  gradual  increase  and  the  sudden  decrease  of 
transmission  rate,  combining  with  a  random  drop  algorithm  similar  to  RED.  We 
analyze  this  ersatz  model  as  the  number  of  competing  flows  becomes  large,  and 
show  under  very  mild  assumptions  that  the  stochastic  model  simplifies  to  a 
two-dimensional  deterministic  recursion  when  the  number  of  flows  grows  large. 
This  result  suggests  that  with  a  large  number  of  flows,  network  operators  might 
be  able  to  easily  estimate  the  aggregate  behavior  of  TCP  flows  and  to  dimension 
network  resources  accordingly. 

First,  the  ersatz  model  is  presented  in  Section  3.1.  The  model  is  organized 
around  the  behavior  of  the  RED  buffer  contents  observed  at  discrete  epochs. 
While  the  model  presented  in  this  chapter  is  very  simple,  it  provides  a  complete 
conceptual  picture  of  the  problem,  i.e.,  the  growing  size  of  the  state  space  as  the 
number  of  flows  increases.  In  Section  3.2,  we  present  the  limit  theorem  of  this 
simple  model  which  demonstrates  the  type  of  results  to  be  expected  in  a  more 
complicated  and  realistic  model,  i.e.,  the  simplification  and  the  limiting  recursion 
through  the  law  of  large  numbers.  Results  from  Monte-Carlo  simulations  of  the 
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model  demonstrating  the  asymptotic  behavior  are  shown  in  Section  3.3. 

3.1  The  Model 

Fix  iV  =  1,2,...,  and  consider  the  situation  where  N  flows  are  active.  For 
simplicity  time  is  assumed  slotted.  Let  Q(N\t)  denote  the  number  of  packets  in 
the  buffer  at  the  beginning  of  timeslot  [t,t  +  1).  In  that  timeslot,  source  i 
generates  A (t)  packets  according  to  a  mechanism  to  be  specified  shortly. 
Following  the  approach  outlined  in  Section  1.3,  we  have  the  recursion  of  the 
queue  Q(N\t)  as  (1.9)  with  A(N\t)  given  by  (1.8). 

In  order  to  fully  specify  the  model,  we  need  to  specify  the  joint  statistics  of 
the  rvs  {A[N\t),  i  —  1, . . . ,  N;  t  —  0, 1, . . .}.  This  will  be  done  in  details  later. 
Throughout,  let  :  M+  — >  [0, 1]  denote  the  dropping  probability  function  of  the 
RED  gateway.  Moreover,  we  find  it  convenient  to  use  the  collection  of  i.i.d. 

[0, 1] -uniform  rvs  {Uiit  +  1),  \ j{t  +1),  i  —  1, . . . ;  t  —  0, 1, . . .}  which  are  assumed 
independent  of  the  rvs  Q(AT0)  and  other  initial  conditions. 

In  this  model,  a  source  either  transmits  or  is  idle  in  a  given  timeslot.  So,  let 
B\N\t  +  1)  be  a  {0,  l}-valued  rv  that  encodes  packet  generation  by  source  i. 
Moreover,  let  R[N\t  +  1)  represent  the  possibility  that  the  packet  generated  by 
source  i  at  the  beginning  of  timeslot  [t,t  +  1)  is  rejected,  i.e.,  R^N\t  +  1)  =  1 
(resp.  R^N\t  +  1)  =  0)  if  the  packet  is  rejected  by  (resp.  accepted  into)  the  RED 
buffer.  Set 

B\N)(t  +  1)  =  1  [Ui(t  +  1)  <  cq(A°(t)]  (3.1) 

where  a\N\t)  is  an  [0,  l]-valued  rv  which  denotes  the  (conditional)  transmission 
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rate  of  traffic  source  i  at  the  beginning  of  timeslot  [t,  t  +  1),  and  let 

Rf\t  +  1)  =  1  [Vi(t  +  1)  <  f{N\Q{N)(t))]  (3.2) 

denote  the  indicator  function  of  the  event  that  the  incoming  packet  from  source  i 
will  be  rejected.  Thus, 

Af\t)  =  (1  -  R\N\t  +  1  ))B(,N\t  +  1).  (3.3) 

To  select  the  transmission  rates  we  argue  as  follows:  Suppose  that  source  i 
generates  no  packet  during  timeslot  [t,t  +  1)  (t,  +  1)  =  0),  then  the 

transmission  rate  of  source  i  in  the  next  timeslot  remains  unchanged.  If  on  the 
other  hand,  a  packet  is  produced  by  source  i  at  the  beginning  of  timeslot 
[t,  t  +  1),  then  either  the  packet  is  successfully  transmitted  (. R[N\t  +  1)  =  0),  or  it 
is  dropped  ( R\N\t  +  1)  =  1).  In  the  former  case,  the  transmission  rate  of  source  i 
in  the  next  timeslot  is  increased  to  a\N\tY~e  (0  <  £  <  1),  while  this  transmission 
rate  is  decreased  by  a  factor  7  (0  <  7  <  1)  to  7 a^N\t)  in  the  latter  case. 

Under  the  constraint  that  transmission  rates  are  bounded  to  the  unit  interval, 
these  rules  attempt  to  emulate  the  additive  increase  and  multiplicative  decrease, 
respectively,  of  the  TCP  congestion-control  by  conservatively  increasing  the 
transmission  rate  if  the  transmission  is  successful  and  reducing  the  transmission 
rate  by  the  factor  7  in  the  event  of  a  packet  loss.  This  can  be  summarized  into 
the  single  equation 

of  b + 1) 

=  of  +  l))sf ’(t  +  1)  +  T +  1  )sf  >((  +  1) 

+  °Pb)(l  —  B\N\t  +  1)).  (3.4) 
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3.2  The  Weak  Law  of  Large  Numbers 


We  are  interested  in  determining  the  limiting  behavior  of  the  rate-based  model  as 
the  number  of  sources  N  becomes  large.  The  discussion  is  carried  out  under  the 
following  assumptions  (AR1)-(AR2): 

(AR1)  There  exists  a  continuous  function  /  :  M+  — >  [0, 1]  such  that  (1.1)  holds  for 
each  iV  =  1,2,...,  i.e., 

f^N\x)  —  f(N~1x),  x  >  0. 


(AR2)  For  each  iV  =  1,2,...,  the  queue  dynamics  start  with  the  conditions 
Q^n\0)  =  0  and  oW(0)  =  a,  i  =  1, . . . ,  N 
for  some  non-random  a  in  (0, 1]. 

Assumption  (AR1)  is  the  structural  condition  discussed  in  Section  1.3,  while 
(AR2)  is  made  essentially  for  technical  convenience  as  it  implies  that  for  each 
N  =  1,  2, . . .  and  all  t  =  0, 1, . . .,  the  rvs  a[N\t), . . . ,  a^\t)  are  exchangeable. 
Assumptions  (AR2)  can  be  omitted  but  at  the  expense  of  a  more  cumbersome 
discussion. 


Theorem  1.  Assume  (AR1)-(AR2)  in  the  rate-based  model  of  Section  3.1.  Then, 
for  each  t  —  0, 1, . . .,  there  exist  a  (non-random)  constant  q(t)  and  a  [0,  l]-va lued 
rv  a(t )  such  that  the  following  holds: 

(i)  The  convergence 


QW{t) 

N 


and  a[N\t)  -W  a(t) 


takes  place; 
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(ii)  For  any  integer  I  —  1,2, . . the  rvs  {a\N\t),  i  =  1, . . . ,  1}  become  asymp¬ 
totically  independent  as  N  becomes  large,  with 

i 

lim  P  a[N\t)  <  Xi,  i  =  1, . . . ,  /  =  IT  P  [a(t)  <  xt\ 

N—>oo  L  J  -*~ 

i=  1 

for  any  X\ , ,xj  in  [0, 1]. 

(Hi)  For  any  p  >  0, 

^  E2,i(“iiV)(*))p  A  E  [Q'(i)’’] .  (3.5) 

Moreover,  with  initial  conditions  q( 0)  =  0  and  a(0)  =  a,  it  holds  that 

q(t+l)  =  [q(t)  -  C+  (1  -  f(q(t)))E  [a(t)}]+  (3.6) 

and 

a(t  +  1)  =st  ci(t)1  "(l  —  R{t  +  1  ))B(t  +  1) 

+  ya(t)R{t  +  1  )B(t  +  1) 

+  ci(t)(  1  —  B(t  +  1)),  (3-7) 

where 

B(t  +  1)  =  1  [U(t  +  1)  <  a(t)}  (3.8) 

and 

R(t  +  l)  =  l[V(t  +  l)<f{q(t))]  (3.9) 

for  i.i.d.  [0, 1  }-uniform  rvs  {U(t  +  1),  V(t  +  1),  t  —  0, 1, . . .}. 

A  proof  of  Theorem  1  is  available  in  [59] . 

Theorem  1  suggests  that  during  timeslot  [t,  t  +  1)  a  bottleneck  queue  driven 
by  a  random  dropping  algorithm  under  a  large  number  of  TCP  sources  can  be 
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characterized  by  a  two-dimensional  recursion  for  the  limiting  normalized  queue 
length  q(t)  and  the  limiting  transmission  rate  a(t).  The  convergence  result 
jy  q{t )  is  a  byproduct  of  the  convergence 

°  A  a(t)  (3.10) 

with 

a(t)  =  0--  f(Q(t)))E[a(t)]. 

This  result,  while  similar  to  the  classical  Weak  Law  of  Large  Numbers,  cannot 
be  obtained  by  a  straightforward  application  of  the  classical  Law  of  Large 
Numbers.  Indeed,  the  summands  in  (1.8)  (under  (AR2))  are  identically 
distributed  but  correlated  rvs  whose  common  distribution  varies  with  N. 

However,  as  the  number  of  sources  increases,  the  dependency  between  any  pair  of 
sources  becomes  weaker  so  that  the  aggregate  behavior  eventually  becomes 
deterministic.  Since  the  transmission  rates  are  random  and  asymptotically 
independent,  Theorem  1  provides  some  indications  that  the  transmission  rates 
among  all  flows  are  no  longer  synchronized  when  the  number  of  flows  is  large.  It 
also  helps  justify  the  use  of  micro-scale  TCP  flow  models  because  any  single  flow 
has  no  impact  to  the  global  behavior  of  the  system  -  The  behavior  of  a  flow  can 
be  decoupled  from  the  system  and  the  model  only  needs  to  take  into  account  the 
reaction  of  a  flow  to  the  system  but  not  vice  versa. 

One  advantage  of  modeling  a  RED  gateway  over  a  Tail-Drop  gateway  is  that 
in  RED  gateway,  there  is  a  fixed  orderly  structure  on  how  the  packets  are  being 
marked/ dropped  depending  on  the  probability  function.  In  a  Tail-Drop  gateway, 
it  is  more  difficult  to  accurately  model  how  incoming  packets  are  dropped,  for 
such  events  usually  depend  on  the  precise  timing  of  packets  arrivals  and 
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departures.  This  absence  of  randomness  in  the  Tail-Drop  gateway  can  also  give 
rise  to  a  complex  phenomenon  such  as  chaotic  behavior  as  suggested  in  [61]. 

3.3  Simulations  of  the  Rate-based  Model 

In  this  section,  we  present  the  results  from  Monte-Carlo  simulations  of  the  ersatz 
model  presented  in  Section  3.1  to  demonstrate  the  asymptotic  behavior  derived 
in  Theorem  1.  We  simulate  the  system  for  N  =  10, 100, 1000  with  £  =  0.1, 

7  =  0.5  and  C  =  0.5;  the  initial  conditions  are  Q(Ar)(0)  =  0  and  =  0.5  for  all 
i  —  1,  ,.,N.  The  drop  probability  is  calculated  through  the  piecewise  linear 
function  /  :  M+  — ►  [0, 1]  given  by 

r 

0  x  <1 

f(x)  =  1  <  x  <  5  (3.11) 

1  5  <  x. 

The  simulation  results  are  shown  in  Figure  3.1  and  3.2.  It  is  clear  that  the 
fluctuation  of  Q^N\t)/N  decreases  as  the  number  of  sources  increases,  and  the 
same  is  true  for  the  average  transmission  rate.  With  a  hundred  or  more  flows, 
our  analytical  result  seems  to  hold  reasonably  well.  Moreover,  this  simulation 
result  also  suggests  the  existence  of  the  steady-state,  which  happens  quickly  after 
only  around  a  hundred  iterations.  Similar  results  obtained  from  a  full-scale 
simulation  of  the  protocol  stacks  in  NS  [45]  are  presented  in  the  next  chapter. 
These  preliminary  findings  comfort  our  belief  that  the  ersatz  model  captures 
some  of  the  essential  features  of  TCP  and  RED. 

We  are  keenly  aware  that  the  ersatz  proposed  here  fails  to  incorporate  some 
key  features  of  TCP  and  RED,  most  notably  round  trip  delays  and  queue 
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Figure  3.1:  The  normalized  queue  length  of  Simulation  1. 
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Figure  3.2:  The  average  transmission  rate  per  user  of  Simulation  1. 

averaging.  It  is  however  noteworthy  that  despite  such  omissions,  the  asymptotics 
derived  from  this  simplified  model  are  qualitatively  in  line  with  the  results 
obtained  via  the  NS  simulator  (which  emulates  in  great  details  the  TCP  stack). 
Thus,  this  qualitative  convergence  points  to  the  possibility  of  creating  simple  and 
yet  accurate  models  that  capture  key  interactions  between  the  TCP  and  RED 
mechanisms  in  the  presence  of  a  large  number  of  TCP  connections.  Indeed,  a 
more  accurate  model  with  a  similar  asymptotic  behavior  is  presented  in  Chapter 
4. 
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Chapter  4 


A  Window-based  Model  of  Persistent  TCP  Population  with 
Homogeneous  Round-trip  Delays 


In  this  chapter,  we  embellish  the  rate-based  model  presented  in  Chapter  3  to 
fully  incorporate  the  window  mechanism  of  TCP  congestion-control.  As  a  result, 
each  TCP  flow  can  transmit  more  than  one  packet  per  timeslot  unlike  the  ersatz 
model  in  Chapter  3.  Another  major  difference  is  that  TCP  flows  presented  here 
use  marked  packets  (e.g.,  by  Explicit  Congestion  Notification  [15])  instead  of 
dropped  packets  as  a  signaling  mechanism  for  congestion-control.  We  take  this 
approach  for  two  reasons:  (i)  Since  marking  reduces  the  amount  of  unnecessary 
retransmissions  and  produces  more  desirable  properties  in  network  traffic,  it  is  a 
signaling  mechanism  of  choice  for  AQM  traffic  management;  (ii)  The  actual 
window  mechanism  for  TCP  regarding  dropped  packets  is  complicated  as  the 
TCP  source  needs  to  receive  triple-duplicated  acknowledgments  before  triggering 
the  congestion-control  mechanism,  hence  the  size  of  the  congestion  window  and 
the  order  of  the  dropped  packets  in  the  timeslot  can  cause  different  behavior. 

For  this  model,  we  establish  a  Weak  Law  of  Large  Numbers  in  Theorem  2,  a 
Central  Limit  Theorem  (CLT)  in  Theorem  3  and  the  steady-state  analysis,  all  of 
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which  reveal  several  interesting  points: 


(i)  Theorem  2  shows  that  the  dynamics  of  the  queue  at  time  t,  still  denoted 
QW(t),  can  be  approximated  by  Nq(t )  with  q(t)  determined  via  a  simple 
deterministic  recursion,  which  is  independent  of  the  number  of  users.  This 
approximation  becomes  more  accurate  as  the  number  of  users  becomes  large. 
The  limiting  model  is  therefore  “scalable”  as  it  does  not  suffer  from  state 
space  explosion,  nor  does  it  require  any  ad-hoc  assumptions. 

(ii)  Theorem  2  also  shows  that  the  dependency  between  each  TCP  flow  becomes 
negligible  under  a  large  number  of  flows,  i.e.,  “RED  breaks  the  global  syn¬ 
chronization  when  the  number  of  flows  is  large.”  More  specifically,  the  win¬ 
dow  sizes  of  all  users  are  asymptotically  independent  and  random,  hence  the 
aggregate  traffic  becomes  smooth  unlike  in  Tail-Drop  gateways  where  syn¬ 
chronization  between  flows  causes  the  aggregate  traffic  to  exhibit  the  saw¬ 
tooth  pattern. 

(iii)  The  bottleneck  capacity  being  NC,  the  average  throughput  of  each  flow  is 
approximately  C  for  large  N,  so  that  TCP  traffic  does  not  follow  a  commonly 
held  belief  associated  with  statistical  multiplexing.  Indeed,  for  N  open-loop 
independent  traffic  sources  operating  at  average  rate  C,  statistical  multiplex¬ 
ing  suggests  that  the  bandwidth  required  to  serve  the  aggregate  flow  is  less 
than  NC.  TCP  flows,  on  the  other  hand,  are  correlated  due  to  their  coupling 
via  the  bottleneck  router,  and  multiplexing  TCP  flows  is  not  as  effective. 

(iv)  The  queue  length  in  steady-state  can  be  easily  calculated,  while  the  steady- 
state  distribution  of  the  window  size  and  the  average  window  size  can  be 
evaluated  from  well-known  TCP  models. 
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(v)  For  a  more  accurate  description  of  the  queue  dynamics,  a  Central  Limit 
Theorem-type  analysis  (summarized  in  Theorem  3)  yields  the  existence  of  a 
rv  L0(t)  such  that  the  approximation  (1.7)  holds. 

(vi)  This  CLT  analysis  also  reveals  that  the  magnitude  of  queue  fluctuations  is 
proportional  to  the  derivative  of  the  marking  probability  function.  This  ad¬ 
vocates  the  use  of  a  smooth  marking  probability  function  in  line  with  the 
RED  “gentle”  option  recently  suggested  as  opposed  to  the  original  recom¬ 
mendation  in  [16,  18].  This  finding  coincides  with  the  observed  oscillatory 
behavior  of  RED  when  the  average  packet  drop  rate  exceeds  max-p}  in  the 
absence  of  RED’s  “gentle”  modification  [14]. 

The  chapter  is  organized  as  follows.  In  Section  4.1,  the  model  is  described  in 
details,  and  a  first  set  of  asymptotic  results  are  presented  in  Section  4.2.  Section 
4.3  focuses  on  the  calculations  of  the  limiting  normalized  queue  size  and  the 
average  window  size  in  steady  state.  Section  4.4  contains  a  Central  Limit 
Theorem  complement  to  the  asymptotic  results  of  Section  4.2.  Simulation  results 
confirming  the  theoretical  results  are  shown  in  Section  4.5.  Applications  to 
network  dimensioning  and  to  marking  probability  function  design  are 
demonstrated  in  Section  4.6. 


4.1  Model  Description 

Overview  of  the  dynamics 

The  TCP  congestion-control  algorithm  dynamically  adjusts  the  size  of  the 
congestion  window  (the  amount  of  unacknowledged  packets  in  the  network  per 
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round-trip)  by  the  following  mechanism  (assuming  ECN/RED  is  utilized  at  the 
bottleneck  node):  If  all  the  packets  transmitted  in  a  round-trip  are  not  marked, 
then  the  size  of  the  congestion  window  is  increased  by  one  packet  for  the  next 
round-trip.  On  the  other  hand,  if  at  least  one  packet  is  marked  in  the  round-trip, 
the  congestion  window  is  halved.  The  probability  that  the  router  will  mark  an 
incoming  packet  in  the  buffer  depends  on  the  average  queue  length  at  the  time  of 
its  arrival.  The  average  queue  length  is  calculated  by  an  exponential  weighted 
average  hlter  with  large  time-constant  to  prevent  RED  from  reacting  too  fast.  As 
a  result,  consecutive  incoming  packets  into  RED  are  marked  with  almost 
identical  probability.  With  this  in  mind,  we  now  construct  a  model  whose 
dynamics  are  similar  in  spirit  to  the  dynamics  of  TCP  +  ECN/RED. 

The  discrete-time  model 

Similarly  to  the  model  in  Chapter  3,  time  is  assumed  discrete  and  slotted  in 
contiguous  timeslots  of  duration  equal  to  the  round-trip  delay  of  TCP 
connections.  We  consider  N  traffic  sources,  all  transmitting  through  a  bottleneck 
RED  gateway  with  ECN  enabled  in  both  TCP  and  RED.  The  capacity  of  this 
bottleneck  link  is  NC  packets/slot  for  some  positive  constant  C.  The  RED  buffer 
is  modeled  as  an  infinite  queue,  so  that  no  packet  losses  occur  due  to  buffer 
overflow,  and  congestion-control  is  achieved  solely  through  the  random  marking 
algorithm  in  the  RED  gateway. 

Dynamics 

Fix  N  =  1,  2, . . .,  and  suppose  that  each  of  the  N  sources  (i.e.,  TCP  connections) 
has  an  infinite  amount  of  data  to  transmit  and  that  in  each  timeslot  it  transmits 
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as  much  as  allowed  by  its  congestion  window  in  that  timeslot.  So,  for 
%  —  1, . . . ,  N,  let  W-N\t)  be  an  integer- valued  rv  that  encodes  the  number  of 
packets  generated  by  source  i  (and  hence  its  congestion  window)  at  the  beginning 
of  the  timeslot  [t,t  +  1).  We  assume  the  integer  w}N\t)  to  be  in  the  range 
{1, . . . ,  Wmax}  for  some  finite  integer  Wmax.  Throughout  we  assume  >  2  to 

avoid  boundary  cases  of  limited  interest. 

Upon  arrival  at  the  RED  gateway,  each  packet  from  source  i  may  be  marked 
according  to  a  random  marking  algorithm  (to  be  specified  shortly).  We  represent 
this  possibility  by  the  {0,  l}-valued  rv  M-j\t  +  1)  (j  =  1, ...,  W^N\t))  with  the 
interpretation  that  M-J\t  +  1)  =  0  (resp.  M-J\t  +  1)  =  1)  if  the  jth  packet 
from  source  i  is  marked  (resp.  not  marked)  in  the  RED  buffer.  Given  that  N 
sources  are  active,  the  total  number  of  packets  which  are  accepted  into  the  RED 
buffer  at  the  beginning  of  timeslot  [t,t  +  1)  is  given  by  (1.8),  where  in  this  model 

Af\t)  :=  H/f »(«),  (4,1) 

Again,  the  dynamics  of  the  queue  size  is  given  as  (1.9),  which  can  be 
rewritten  as 


Q{N\t  +  1) 


N 


+ 


Q(N>(t)-NC  +  ^W^\t) 

i= 1 


(4.2) 


Statistical  assumptions 

In  order  to  fully  specify  the  model,  we  need  to  specify  the  joint  statistics  of  the 
rvs  +  1),  i  —  1, . . . ,  N;  j  =  1,  2, ... ;  t  =  0, 1, . . .}.  To  do  so  we 

introduce  the  collection  of  i.i.d.  [0, 1  }-uniform  rvs 

{Vi(t  +  1),  Vij(t  +  1),  i,j  —  !,•••;  t  —  0, 1, . . .}  which  are  assumed  independent 
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of  the  rvs  A^(0), . . . ,  kL^(O)  and  (2^(0).  We  also  introduce  a  mapping 
f(N^  :  M+  — >  [0, 1]  which  acts  as  the  marking  probability  function  of  the  RED 
gateway. 

The  process  by  which  packets  are  marked  is  described  first:  For  each 
i  —  1, . . . ,  N,  we  define  the  marking  rvs 

M%\t  + 1)  =  l[Vid{t+l)>fW{QW(t))],.  j  =  1,2,...  (4.3) 

so  that  the  rv  M^\t  +  1)  is  the  indicator  function  of  the  event  that  the  jth 
packet  from  source  i  is  not  marked  in  the  timeslot  [t,t  +  1).  Thus,  in  a 
round-trip,  each  packet  coming  into  the  router  is  marked  with  identical 
(conditional)  probability  which  depends  only  on  the  queue  length  at  the 
beginning  of  the  timeslot.  This  model  approximates  the  case  where  the  memory 
of  the  queue  averaging  mechanism  is  long,  which  is  the  case  for  the  recommended 
parameter  settings  of  RED  [16] . 

Next  we  introduce  the  rvs 

4N\t) 

Mf\t+ 1)=  n  V«f((  +  1)  (4.4) 

3= 1 

so  that  M-N\t  +  1)  =  1  (resp.  M-N\t  +  1)  =  0)  corresponds  to  the  event  that  no 
packet  (resp.  at  least  one  packet)  from  source  i  has  been  marked  in  timeslot 
[' t,t+  1).  Recall  that  in  this  model,  A[N\t)  =  W-N\t),  i.e.,  each  connection 
injects  as  many  packets  into  the  network  as  its  congestion  window  will  allow. 

The  evolution  of  the  window  mechanism  for  source  i  can  now  be  described 
through  the  recursion 

wf°(t  + 1)  =  mm(w^N\t)  +  l,Wma^M^N\t+l)  (4.5) 

+  min  ( f W'  2  ^  ] ,  Wm ax^  (1  -  MlN) (; t  +  1)). 


49 


This  equation  emulates  the  interaction  between  TCP  and  RED  as  follows:  If  no 
packet  from  source  i  is  marked  in  the  timeslot  [t,t  +  1),  then  the  congestion 
window  size  in  the  next  timeslot  is  increased  by  one  packet.  On  the  other  hand, 
if  one  or  more  packets  are  marked  in  the  timeslot  [t,t  +  1),  then  the  congestion 
window  in  the  next  timeslot  is  reduced  by  half.  The  size  of  the  congestion 
window  is  limited  by  the  maximum  window  size  IEmax  1. 

4.2  The  Asymptotics 

The  first  result  of  this  model  consists  in  the  asymptotics  for  the  normalized 
buffer  content  as  the  number  of  TCP  flows  becomes  large.  This  result,  contained 
in  Theorem  2,  is  discussed  under  the  following  assumptions  (AW1)-(AW2): 

(AW1)  There  exists  a  continuous  function  /  :  M+  — >  [0, 1]  such  that  for  each  N  = 
1,2,.. .,  Equation  (1.1)  holds. 

(AW2)  For  each  N  =  1,2,...,  the  dynamics  (4.2)  and  (4.5)  start  with  the  conditions 

Q(JV)  (0)  =  0  and  IE jN)  (0)  =  IE,  i  =  1, . . . ,  N 

for  some  integer  IE  in  the  range  {1, . . . ,  IEmax}. 

Note  that  these  assumptions  are  similar  in  nature  to  Assumption 
(AR1)-(AR2)  in  Chapter  3. 

1Note  also  that  if  W-  N->  (0)  lies  in  the  range  {1, . . . ,  IEmax}  for  each  i  =  1, ...  ,1V,  then  so  does 
W}N\t)  for  each  t  =  0,1,...  and  the  minimum  with  IEmax  in  the  second  term  of  (4.5)  can  be 
omitted. 
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Theorem  2.  Assume  (AW1)-(AW2)  to  hold.  Then ,  for  each  t  =  0,1,...,  there 
exist  a  (non-random)  constant  q(t )  and  an  {1, . . . ,  Wmax}-valued  rv  W ( t )  such  that 
the  following  holds: 

(i )  The  convergence  results 

—  ~*n  q(t )  and  w[N\t)  => N  W(t )  (4.6) 

take  place; 

(ii)  For  any  function  g  :  N  — >  M, 

i,  Ei.s( Wf°(*))  E  {g(W(m  .  (4.7) 

(Hi)  For  any  integer  I  =  1,  2, . . .,  the  rvs  {W^N\t),  i  =  1, .  . . ,  1}  become  asymp¬ 
totically  independent  as  N  becomes  large,  with 

i 

lim  P  \wt{N\t)  =  h,  i  =  1, . . . ,  i]  =  ff  P  [W(t)  =  h]  (4.8) 

N—>oo  L  J  -a--a- 

i=  1 

for  any  ki, ...  ,ki  in  N 

Moreover,  with  initial  conditions  q( 0)  =  0  and  0)  =  W ,  it  holds  that 

q(t+l)  =  (q(t)-C  +  E[Wm+  (4-9) 

and 

W(t  +  1)  min  (W(t)  +  1,  Wmax)  M(t  +  1)  (4.10) 

+  min  ^ [ -yp-] ,  Wma^J  (1  -  M {t  +  1)), 

where 

M(t  +  1)  =  1  [V(t  +  1)  <  (1  -  f(qmW{t) }  (4.11) 

for  i.i.d.  [0, 1  ]-uniform  rvs  {V(t  +  1),  t  =  0, 1, . . .}. 
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A  proof  of  Theorem  2  is  available  in  Chapter  6.  As  should  be  clear  from  the 
discussion  given  there,  Theorem  2  readily  flows  from  a  Weak  Law  of  Large 
Numbers  (4.7)  for  the  triangular  array 

{' WlN\t ),  i  —  1, . . . ,  N]  N  —  1,2,...}.  (4.12) 

The  numerical  calculations  for  the  limiting  model  are  very  simple.  The 
number  of  states  required  for  the  calculation  for  each  time  step  is  only  IFmax  +  1 
regardless  of  N.  For  each  t  =  0, 1, . . . ,  we  can  determine  q(t)  through  the 
following  steps: 

(i)  For  t  —  0,  start  with  some  given  values  g(0)  =  0  and  W{ 0)  =  W,  i.e., 

P  [W (0)  =  j]  =  5(j,  IF),  j  =  1, ,  lFmax,  and  use  E  [IF (0)]  =  IF  to  calculate 
q(  1)  via  (4.9); 

(ii)  Given  q(t )  and  P[IF(f)  =  j],  j  =  1 ,  . . . ,  lFmax ,  for  some  t  =  0,1,...,  use 
(4.10)  and  (4.11)  with  q{t)  to  calculate  the  transition  probabilities  and 

P  [IF (t  +  1)  =  j] ,  j  —  1, . . . ,  lFmax.  Then  calculate  E  [IF ( t  +  1)] ; 

(iii)  Lise  E  [W(t  +  1)]  in  (ii)  to  update  q(t  +  2)  from  (4.9); 

(iv)  Increase  t  by  one  and  repeat  Step  (ii)-(iv). 

Examples  of  the  numerical  calculation  of  the  limiting  model  will  be  presented 
in  Section  4.5. 


4.3  Steady-state  Regime 

We  now  turn  our  attention  to  the  steady  state  regime  of  the  limiting 
two-dimensional  recursion  (4.9)-(4.11),  more  specifically  to  the  calculation  of  the 
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limiting  queue  and  average  window  size  in  statistical  equilibrium,  i.e.,  large  t 
asymptotics.  We  show  that  there  is  a  close  relationship  between  this  limiting 
behavior  in  steady  state  and  the  now  standard  TCP  throughput  model  with 
stationary  loss  developed  in  [47,  48]: 

Throughout,  we  assume  the  following  assumptions  (AW3)-(AW4),  where 
(AW3)  The  marking  function  /  :  M  — »  [0, 1]  is  increasing  with 

/( 0)  =  0  and  lim  f(x)  =  1; 

XX 

(AW4)  The  sequence  {(g(f),  W(t)),  7  =  0,1,...}  admits  a  steady  state  in  the  sense 
that 

(q(t),W(t))^t  (q* ,  W*) 

for  some  rvs  (g*,  W*)  where  q*  is  non-random  and  W*  is  an  {1, ... ,  Wm aY}~ 
valued  rv. 

Although  the  two-dimensional  sequence  {(g(f),  W(t)),  t  —  0, 1, . . .}  is  a 
time-homogeneous  Markov  chain  with  values  in  M+  x  {1, . . . ,  Wmax } ,  we  shall  not 
address  here  the  existence  of  the  limit  posted  in  (AW4)  as  complications  arise 
due  the  fact  that  the  first  component  is  degenerate  (i.e.,  deterministic). 
Control-theoretic  analyses  (as  surveyed  in  Section  2.2)  suggest  that  the  existence 
of  the  limit  in  the  sense  of  (AW4)  depends  on  the  parameter  settings  of  RED.  If 
the  parameters  are  set  appropriately,  then  the  queue  will  stabilize  to  its 
fixed-point  solution.  Otherwise,  the  steady-state  queue  can  fluctuate  around  its 
fixed-point  solution  (due  to  border  collisions)  or  can  even  exhibit  chaotic 
behavior.  In  any  case,  only  the  steady-state  in  the  sense  of  (AW4)  is  desirable. 
Discussions  on  the  parameter  settings  in  RED  to  ensure  (AW4),  i.e.,  stability, 
can  be  found  in  Section  2.2. 
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However,  that  under  the  condition  ITmax  <  C,  it  is  a  simple  matter  to  check 
that  (AW4)  holds  with  q*  =  0  and  W*  =  Wmax  as  would  be  expected;  related 
details  are  given  in  Section  4.3.2  (Case  1).  Thus,  only  the  case  C  <  Wmax  needs 
to  be  considered. 

4.3.1  TCP  throughput  with  fixed  loss  probability 

In  order  to  discuss  the  limit  (q*,  IT*)  postulated  in  (AW4),  we  first  need  to 
review  the  results  from  the  TCP  throughput  model  with  stationary  loss  [47,  48]: 

Fix  7  in  [0, 1],  where  7  =  1  —  p  with  p  interpreted  as  the  stationary 
dropping/marking  probability.  Consider  the  recursion 

Wo7  =  Wo] 

=  min  (WZ  +  1,  IfJ  M„y,  +  min  U-jU  »»,„)  (1  -  M’+1) 

for  all  n  —  0, 1, . . .  and  some  (1, . . . ,  valued  rv  IT0 ,  with 

M:+1  =  1  [K+1  <  7^]  ,  n  =  0,1,...  (4.13) 

where  the  rvs  {Vn+i,  n  —  0, 1, .  . .}  are  i.i.d.  [0,  l]-uniform  rvs  which  are 
independent  of  IT0. 

The  rvs  [W^,  n  —  0, 1, . . .}  form  a  time-homogeneous  Markov  chain  on  the 
finite  set  (1, . . . ,  Wmax}.  For  7  in  the  open  interval  (0, 1),  this  chain  is  irreducible, 
positive  recurrent  and  aperiodic,  thus  ergodic.  Consequently, 

W^  =► „  W1  (4.14) 

for  some  (1, . . . ,  IFmax }-valued  rv  IT7.  This  rv  IT7  satishes  the  distributional 
equation 

IT7  min  (IT7  +  1, ,  Wmax)  M7  +  min  ^[y],^  (1  -  M7)  (4.15) 
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where 


M1  —  l[V  <  7W7]  (4.16) 

for  some  [0,  l]-uniform  rv  V  which  is  independent  of  the  rv  IF7.  In  fact,  the 
ergoclicity  of  the  Markov  chain  guarantees  that  the  equation  (4.15)-(4.16)  has  a 
solution  and  that  this  solution  is  unique. 

For  the  sake  of  completeness,  we  also  consider  the  boundary  cases:  For  7  =  0 
(resp.  7  =  1),  it  is  easy  to  see  that  (4.14)  also  takes  place  with  W 7  =  1  (resp. 

W 7  =  Il'max)-  The  converse  is  also  true  as  we  now  demonstrate:  If  W 7  =  1  under 
(4.14),  then  (4.15)-(4.16)  read 

1  =st  min  (1  +  M7,  IFmax) 

with  M7  =  1  [V  <  7],  so  that  necessarily  M7  =  0  under  (AW4),  whence  7  =  0. 

On  the  other  hand,  if  W1  =  Wmax  under  (4.14),  then  (4.15)-(4.16)  now  reduce 
to 

Wmax  =st  VFmaxM7  +  r^l(l  -  M7) 

with 

M7  =  1[V  <  7W/max]  . 

Consequently, 

L%ij  =,t  m\ 

so  that  M7  =  1,  whence  7  =  1. 

Although  we  can  numerically  compute  the  steady-state  distribution 
determined  by  (4. 15)- (4. 16)  (which  is  a  special  case  of  [47]),  we  take  notice  that 
this  model  is  actually  that  of  a  single  TCP  connection  with  a  constant  loss 
probability  1  —  7.  This  is  a  well-studied  problem  (e.g.,  [40,  48])  with  known 
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results.  If  we  replace  [ArJ  in  (4-15)  with  then  we  can  invoke  Eqn.  (33)  in 
[48]  to  get  the  approximation 

B  \ym]  ~  min  .  (4.17) 

4.3.2  Steady-state  regime  for  the  model  in  Section  4.1 

Under  Assumption  (AW4),  it  is  a  simple  matter  to  see  that 

Wit),  M(t  +  1))  =>t  (q*,  W*,  M*) 

with 

M*  =  1  [V  <  (1  -  f(q*))W *]  (4.18) 

where  the  [0, 1] -uniform  rv  V  is  independent  of  the  pair  [q*,  W*). 

Upon  letting  t  go  to  infinity  in  (4.9)  and  (4.10),  we  obtain  the  relations 

q*  =  (q*  -C  +  E[W*})+  (4.19) 

and 

/  t-y  *  \ 

w*  =st  mm(W*+l,Wmax)M*  +  mrnl\—],WmaxUl-M*).  (4.20) 

With  g*  given,  the  solution  to  (4.20)  with  (4.18)  exists  and  is  unique;  it  is  in  fact 
given  by 

W*  =  W7  with  7  =  1-  f(q*) 

where  the  rv  IU7  is  defined  through  (4.14).  Several  cases  are  possible  when 
considering  (4.19)  and  (4.20). 

Case  1  -  f(q*)  =  0:  Then,  q*  is  finite  under  (AW3),  M*  =  1  and  (4.20) 
reduces  to 

W*  —st  min  (W*  +  1,  IUmax) 
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with  unique  solution  W*  =  Wmax  (in  agreement  with  the  discussion  in  Section 
4.3.1).  In  that  case,  (4.19)  gives 

q*  =  (q*-C+Wmax)+.  (4.21) 

Therefore,  either  q*  =  0  in  which  case  bbmax  <  C  as  should  be  expected,  or  q*  >  0 
(still  with  f(q*)  =  0),  in  which  case  C  =  bbmax •  However,  under  the  condition 
Wmax  <  C,  it  is  clear  that  if  0)  =  Q  >  0  for  all  N  —  1,  2, . . .,  then 
Q{N\t)  <  Q  and  the  conclusion  q(t)  =  0  holds  for  all  t  —  0, 1, . . .,  so  that  q*  =  0. 
In  other  words,  if  q*  in  (AW4)  is  such  that  f(q*)  =  0,  then  necessarily  q*  =  0.  □ 

Case  2  -  f(q*)  =  1:  Then  q *  >  0,  M*  =  0  and  (4.20)  now  reduces  to 

W  —st  min  (  [  ,  Wmax\ 

with  only  solution  W*  =  1  (also  in  agreement  with  the  discussion  in  Section 
4.3.1).  □ 


Case  3  -  0  <  f(q*)  <  1:  Then,  0  <  q*  <  oo  and  from  (4.19)  it  is  necessarily 
the  case  that  E  [W*]  =  C.  Thus,  the  existence  of  a  steady  state  requires  at  the 
very  least  that  the  equation 

E  [bb7]  =  C,  7  £  [0, 1]  (4.22) 

has  a  unique  solution,  say  7*,  in  which  case  we  must  have 

7*  =  1  -  f(q*). 

By  known  results  on  finite  state  Markov  chains,  the  mapping  7  — >  E  [bb7]  is 
continuous  on  [0, 1]  [39]  with 

E  [bb7]7=0  =  1  and  E  [bb7]7=1  =  bbmax. 
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By  continuity,  {E  [IE7] ,  7e  [0, 1]}  must  contain  the  interval  [1,  Wmax] .  On  the 
other  hand,  it  is  always  the  case  that 

1  <  E  [10 '1  <  i'i'niax ,  7  £  [0,  1], 

and  we  conclude  that  {E  [IE7] ,  7  G  [0, 1]}  =  [1,  fEmax].  Therefore,  there  exists  at 
least  one  solution  to  (4.22)  provided  1  <  C  <  lEmax.  The  uniqueness  of  the 
solution  would  be  ensured  by  the  strict  monotonicity  of  the  mapping 
7  — >  E  [IE7].  Although  we  have  not  been  able  to  establish  the  monotonicity,  it  is 
suggested  by  the  approximation  (4.17)  and  also  from  the  simulation  result 
illustrated  in  Figure  4.1.  □ 


Figure  4.1:  The  mapping  7  — »  E  [IE7]  when  TEmax  =  5  vs.  the  approximation 
(4.17). 

4.3.3  Discussion 

The  preceding  analysis  demonstrates  that  the  resulting  steady-state  queue  and 
congestion  window  can  be  determined  by  the  steady-state  marking  probability 


f(q*)-  We  now  discuss  different  cases  that  arise  depending  on  the  value  f{q*)- 

The  trivial  case  f(q*)  =  0  is  possible  if  and  only  if  the  maximum  window  size 
Vhmax  <  C,  or  equivalently,  no  congestion  exists  and  congestion-control  is  not 
necessary.  On  the  other  hand,  when  f(q*)  =  1  the  window  size  will  be  reduced  to 
its  minimum  value  (which  is  one  packet  in  this  model).  Even  then,  the  congestion 
is  so  severe  that  the  limiting  queue  will  grow  unboundedly  large.  This  case  is 
possible  if  and  only  if  C  <  1,  and  this  is  clearly  not  a  desirable  operating  regime 
for  the  system. 

In  the  non-trivial  case  where  0  <  f(q*)  <  1,  the  average  throughput  will  be 
equal  to  the  capacity  per  flow  C.  Furthermore,  E  [hF1_fW)]  —  (j  where 
is  the  steady-state  rv  of  the  Markov  chain  in  (4.15)  when  7  =  1  —  f{qk).  In  other 
words,  the  throughput  of  a  flow  in  steady  state  can  be  evaluated  as  a  TCP  flow 
with  fixed-loss  probability  f(q*)-  Conversely,  assuming  C  is  known,  the 
(assumed)  monotonicity  of  the  mapping  7  — >  E  [W7]  along  with  (4.22)  can  be 
used  to  evaluate  the  steady-state  marking  probability  f(q*)  and  the  steady-state 
queue  length  q *  (assuming  /  is  invertible).  This  application  to  network 
dimensioning  is  discussed  in  more  details  in  Section  4.6. 

Finally,  we  note  that  the  steady-state  analysis  implies  that  the  behavior  of 
any  single  TCP  flow  is  decoupled  from  the  dynamics  of  the  system  when  the 
number  of  flows  is  large,  i.e.,  the  behavior  of  a  single  flow  no  longer  affects  the 
dynamics  of  the  system.  This  enables  the  performance  analysis  of  the  system  to 
be  done  at  two  levels:  (i)  From  a  user’s  perspective,  the  network  only  generates  a 
fixed  marking/signaling  rate  to  a  TCP  flow  and  the  performance  of  a  TCP  flow 
can  be  evaluated  from  the  TCP  throughput  formula  such  as  in  [48];  and  (ii)  From 
the  network  operator’s  perspective,  the  aggregate  traffic  and  its  interaction  with 
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AQM  can  be  easily  predicted  without  having  to  consider  any  particular  flows. 


4.4  A  Central  Limit  Theorem 


In  this  section,  we  present  a  Central  Limit  Theorem  (CLT)  which  complements 
the  limiting  results  obtained  earlier.  The  discussion  is  carried  out  in  the  setup  of 
Section  4.2,  but  with  Assumption  (AW1)  strengthened  to  read  as  Assumption 
(AW lb),  where 


(AWlb)  Assumption  (AW1)  holds  with  continuously  differentiable  mapping  /  : 
[0, 1],  i.e.,  the  derivative  f  :  M+  — >  M  exists  and  is  continuous. 

For  each  t  =  0, 1, . . .,  let  q(t)  and  W(t)  be  as  in  Theorem  2,  and  set 


N 


(4.23) 


and 


N 


i<a'iw  =  v(E'F'v)w-eiw/w]  ■ 


N 


(4.24) 


,  i—  1 


Theorem  3.  Assume  (AWlb)-(AW2)  to  hold.  Then,  for  each  t  =  0, 1, . . .,  there 
exists  an  M2-va lued  rv  L (t)  =  (L0(t) ,  L{t))  such  that  the  convergence 


— — 71  jv  L(() 


(4.25) 


holds.  Moreover,  the  distributional  recurrence 

0 


L0(t  +  1)  —  st  < 


K(t)  >  0 
L0(t)+L(t)  K(t)  <  0 
(L0(t)  +  L(t))+  K(t)=  0 


(4.26) 
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holds  where  we  have  set 


K(t)  =  C-  q(t)  -  E  \W(t) ] . 


(4.27) 


The  convergence  (4.25)  suggests  the  approximation  (1.7)  and 

N 

WiN) (t)  ^  NE  [W{t)\  +  y/NL{t)  (4.28) 

i= 1 

for  large  N. 

We  can  interpret  K(t)  as  the  residual  capacity  per  user  in  the  limit  in  the 
timeslot  [t,t  +1).  If  there  exists  extra  capacity  for  the  average  user  rate  to 
increase  (K(t)  >  0),  then  there  is  no  fluctuation  in  the  limiting  queue.  On  the 
other  hand,  when  there  is  congestion  (K(t)  <  0),  the  (non-trivial)  limiting 
distribution  can  be  found  recursively.  Some  technical  difficulties  arise  in  the 
special  case  K(t)  =  0. 

The  proof  of  Theorem  3  is  given  in  Chapter  7,  and  relies  on  showing  that 
some  key  convergence  statements  propagate  over  time.  If  we  specialize  (7.6),  one 
of  the  by-product  of  this  analysis,  to  the  mapping  g  :  IN  — >•  R  given  by  g(w)  =  w, 
we  find  that  L(t  +  1)  is  of  the  form 

L(t  +  1)  =*  A(t)  +  f'(q(t))R(t)L0(t)  +  H(t  +  1)  (4.29) 

where  R(t)  is  a  constant,  H(t  +  1)  is  a  zero-mean  Gaussian  rv  independent  of  the 
pair  of  rvs  (L0(t),A(t))  and  the  statistics  of  rv  A (t)  are  determined  by 

q(  o),e(i),  •■■,?(< -i)- 

From  (4.29),  we  observe  that  the  magnitude  of  the  queue  fluctuation  is 
proportional  to  the  derivative  of  /  around  the  limiting  (normalized)  queue  size 
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q(t).  Since  the  original  drop  probability  function  of  RED  is  discontinuous  around 
max-thresh,  our  analysis  is  very  much  in  line  with  the  reported  oscillatory  queue 
behavior  when  the  average  queue  exceeds  rriaxAhresh  and  with  the  hireling  that 
the  addition  of  the  “gentle”  option  can  improve  the  queue  behavior  [14]. 

To  get  an  intuition  on  why  the  derivative  of  the  marking  function  plays  such 
an  important  role,  note  that  /(— j^-)  is  the  only  feedback  information  from  the 
RED  gateway  to  the  TCP  sources.  However,  this  feedback  information  also 
fluctuates  around  its  limiting  mean  f(q(t)).  It  is  easy  to  imagine  that  the 
uncertainty  in  the  feedback  information  will  also  lead  to  fluctuations  in  the 
limiting  queue  as  is  suggested  by  the  following  result  which  is  a  well-known  result 
called  the  Delta  method  [60,  Thm.  3.1,  p.26]). 


Proposition  1.  Let  f  :  R+  — >  [0, 1]  be  a  differentiable  mapping  with  derivative 
f  :  R+  — >  R  continuous  at  x  =  q(t).  If  for  some  t  =  0,1,...,  the  convergence 


Vn 


L0(t ) 


(4.30) 


takes  place  with  some  rv  Lq  (t) ,  then 


f(q(t))L0(t). 


(4.31) 


4.5  Simulations 

In  this  section,  we  present  results  from  (Monte-Carlo)  simulations  of  the  model 
presented  in  Section  4.1  and  from  NS  simulations  [45]  to  illustrate  the  behaviors 
suggested  by  both  Theorems  2  and  3.  For  the  NS  simulations,  we  use  the  system 
shown  in  Figure  4.2.  Each  server  establishes  a  TCP  Reno  connection  to  a 


62 


corresponding  client,  thereby  competing  for  the  capacity  in  the  ECN/RED 
gateway.  Each  TCP  has  a  fixed  packet  size  of  1500  bytes  and  a  maximum 
window  size  of  200  packets.  The  marking  probability  function  in  the  ECN /RED 
gateway  is  specified  as  in  (AW1)  with  /  :  M+  — >  [0, 1]  taken  to  be 

f(x)  =  min  (O.Olf/c  —  1)  +  ,  l)  ,  x  >  0. 

We  choose  this  simple  marking  probability  function  with  only  one  major  slope  in 
order  to  later  demonstrate  the  relationship  between  the  magnitude  of  the  queue 
fluctuations  and  the  slope  of  /  as  mentioned  in  Section  4.4. 

The  “time  constant”  parameter  wq  for  the  Exponential  Weighted  Moving 
Average  is  set  to  0.002,  similar  to  the  recommended  value  in  [16].  Every  round 
equals  the  round-trip  propagation  delay  of  200  milliseconds.  At  the  beginning  of 
each  round,  we  collect  the  instantaneous  queue  length  in  the  ECN /RED  buffer 
for  a  total  duration  of  200  seconds.  Figure  4.3  shows  the  queue  length  normalized 
by  the  number  of  connection  (N)  as  a  function  of  time.  We  note  a  behavior 
similar  to  that  discussed  in  Theorem  2  as  fluctuations  in  the  normalized  queue 
length  decrease  with  an  increasing  number  of  connections.  Moreover,  Figure  4.3 
also  suggests  the  existence  of  a  steady-state  for  the  limiting  model,  with  a 
steady-state  normalized  queue  length  being  constant  at  approximately  4.85 
packets/user,  corresponding  to  the  steady-state  marking  probability  of 
0.0385  =  /(4.85). 

To  simulate  the  model  described  in  Section  4.1,  we  use  the  same  parameter 
setup  as  in  the  NS  simulation,  i.e.,  =  200,  simulation  time  of  1000  timeslots 

and  the  same  marking  function.  The  capacity  per  user  ( C )  of  the  bottleneck 
router  can  be  calculated  from  (4.17)  and  (4.22)  when  the  marking  probability  p  is 
0.0385  (obtained  from  the  NS  simulation,  so  that  7  =  1  —  p  =  .9615).  A  simple 
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Figure  4.2:  The  topology  setup  in  NS  simulation. 


Normalized  Queue  Length  from  NS-2  simulation 


Figure  4.3:  The  normalized  queue  length  of  the  ECN/RED  gateway  in  NS  simu¬ 
lation. 
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calculation  yields  C  =  6.24  packets/timeslot.  The  simulation  results  are  shown  in 
Figure  4.4.  A  quick  comparison  to  Figure  4.3  indicates  a  qualitative  similarity  in 
that  the  fluctuations  decrease  as  the  number  of  users  increases.  Further 
inspection  reveals  that  the  average  normalized  queue  length  is  around  4.93 
packets/user,  very  close  to  4.85  packets/user  produced  by  the  NS  simulation. 
Therefore,  the  limiting  stochastic  model  appears  to  capture  the  essential  behavior 
of  queue  dynamics  in  ECN/RED  gateways,  although  the  model  exhibits 
somewhat  greater  fluctuations  than  in  the  NS  simulation.  This  is  due  to  the  fact 
that  all  flows  in  the  model  are  synchronized  at  the  beginning  of  each  timeslot, 
i.e.,  they  adapt  at  the  same  time  while  in  the  NS  simulator,  the  flows  react 
asynchronously  to  the  marks  from  the  RED  gateway. 

To  gauge  the  rate  of  convergence,  we  assume  that  the  queue  is  in  steady  state 
after  the  first  100  samples  and  that  the  magnitude  of  the  queue  fluctuation 
around  its  steady  state  mean  is  Gaussian  and  ergodic.  Therefore,  the 
steady-state  standard  deviation  of  the  queue  can  be  reasonably  approximated 
from  the  sample  standard  deviation  of  the  queue  at  timeslot  101  and  beyond. 

The  comparison  between  the  sample  standard  deviation  from  the  model  and  NS 
simulations  is  displayed  in  Figure  4.5.  They  clearly  follow  a  similar  trend.  We 
also  expect  from  Theorem  3  that  the  standard  deviation  will  decrease  as  1/y/N 
for  large  N.  Let  denote  the  sample  standard  deviation  of  the  normalized 
queue  when  the  number  of  users  is  N.  We  can  see  from  Figure  4.5  that  Si/y/N 
provides  a  good  approximation  of  the  standard  deviation  Sn  for  large  N. 

To  demonstrate  the  relationship  between  the  slope  of  the  marking  probability 
function  and  the  magnitude  of  the  queue  fluctuation  (as  mentioned  in  Section 
4.4),  we  use  the  same  network  setup  as  before  but  with  a  slope  increased  ten-fold, 
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Normalized  Queue  Length  from  the  model 


Figure  4.4:  The  normalized  queue  length  of  the  model. 


i.e., 


f(x)  =  min  (0.1(a;  —  1)+,  l)  ,  x  >  0.  (4.32) 


Figure  4.6  and  4.7  show  the  simulation  results  in  the  case  of  the  steeper  function 
(4.32).  In  both  the  Monte-Carlo  and  NS  simulations  it  is  clear  that  the 
magnitude  of  queue  fluctuation  is  much  larger  than  in  the  original  simulation.  As 
the  number  of  flows  increases,  the  convergence  becomes  much  slower  than  in  the 
original  setup.  In  the  control-theoretic  view  of  [22],  this  phenomenon  can  be 
interpreted  as  the  control  system  becoming  oscillatory  with  too  large  a  feedback 
gain. 
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Comparison  of  the  standard  deviation  from  the  simulations  of  NS  and  Model 


Figure  4.5:  Sample  standard  deviation  of  the  normalized  queue. 

4.6  Simple  Network  Dimensioning 

We  briefly  discuss  how  to  apply  the  convergence  results  of  Theorems  2  and  3  to 
the  network  dimensioning  problem.  In  Section  4.3,  it  is  shown  that  the  steady 
state  throughput  of  the  limiting  behavior  can  be  calculated  from  a  well-known 
TCP  throughput  model  with  fixed  loss  probability,  e.g.,  [40,  48]. 

We  now  consider  a  simple  application  of  this  limiting  result;  An  ISP  currently 
services  up  to  N1  TCP  flows  at  peak  hour  through  an  ECN /RED  access  gateway 
connecting  to  the  core  network  with  the  link  speed  of  N\C  packets/second.  The 
network  manager  can  roughly  determine  the  buffer  utilization  in  the  ECN /RED 
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Normalized  Queue  Length  from  NS  simulation 


Figure  4.6:  The  normalized  queue  length  of  the  ECN/RED  gateway  in  NS  simu¬ 
lation  with  the  marking  probability  function  (4.32). 


68 


Normalized  queue  length  (packets/users) 


Normalized  Queue  Length 


Figure  4.7:  The  normalized  queue  length  of  the  model  with  the  marking  probability 
function  (4.32). 
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gateway  as  follows: 

(i)  Determine  the  marking  probability  per  flow  (p  —  f(q)  =  1  —  7)  from  the 
relation  C  —  E  [bb7]  by  using  a  TCP  throughput  formula  such  as  (4.17); 

(ii)  Calculate  the  limiting  queue  length  q  in  steady  state  by  solving  p  =  f(q ) 

(iii)  Approximate  the  queue  length  distribution  in  steady  state  via  the  CLT 
complement.  If  the  steady  state  exists,  the  CLT  complement  determines  the 
distribution  of  the  queue  size  fluctuations  around  q.  The  delay  and  overflow 
distributions  can  also  be  approximated  via  the  CLT  complement. 

While  these  limiting  results  apply  only  to  TCP  flows  with  identical 
round-trip,  there  are  situations  where  they  could  be  useful.  For  example,  the 
buffer  dimensioning  problem  in  an  intercontinental  Internet  link  where  it  is 
typically  a  bottleneck,  its  large  propagation  delay  dominates  the  round-trip  and 
the  number  of  flows  is  extremely  large. 
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Chapter  5 


A  Window-based  Model  with  Session-level  Dynamics  and 
Heterogeneous  Round-trip  Delays 


The  main  goal  of  this  thesis  is  in  modeling  TCP  traffic  in  a  realistic  situation 
where  there  are  many  flows  competing  for  bandwidth.  This  is  a  very  challenging 
problem  due  to  the  following  factors:  (i)  the  state  space  explosion,  (ii)  AQM 
schemes,  (iii)  session-layer  dynamics,  and  (iv)  variable  RTTs  of  TCP  flows.  The 
model  presented  in  Chapter  4  takes  into  account  the  factor  (i)-(ii).  In  this 
chapter,  we  extend  this  model  to  incorporate  the  session-level  dynamics  and 
variable  round-trip  delays  of  TCP  connections. 

Accurate  traffic  modeling  of  a  large  number  of  TCP  flows  with  session 
dynamics  and  heterogeneous  RTTs  is  extremely  difficult  due  to  the  interactions 
between  session,  transport,  and  network  layers,  notwithstanding  the  state  space 
explosion  mentioned  earlier  and  the  variable  feedback  information  rate.  As 
presented  in  the  literature  survey  in  Section  2.2,  ad-hoc  assumptions  are  typically 
required  to  make  the  analysis  tractable  under  a  certain  regime. 

In  this  chapter,  we  present  a  novel  model  which  incorporates  not  only  the 
interaction  of  the  congestion-control  mechanism  of  TCP  with  ECN /RED 
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mechanism,  but  also  session  dynamics  and  variable  RTTs  of  the  flows.  It  builds 
upon  the  approach  used  in  Chapter  3  and  4,  where  “macro-scale”  modeling  of 
aggregate  TCP  flows  can  be  developed  by  systematically  applying  limit  theorems 
to  describe  the  regime  when  the  number  of  TCP  flows  is  large. 

The  chapter  is  organized  as  follows.  We  first  present  our  extended  model  in 
Section  5.1.  Based  on  this  model,  we  establish  a  Weak  Law  of  Large  Numbers  in 
Section  5.2.  The  result  shows  that  the  queue  size  per  session  and  the  workload 
per  session  brought  in  during  a  RTT  converge  to  a  deterministic  process  as  the 
number  of  flows  increases.  We  also  demonstrate  that  the  flows  become 
asymptotically  independent,  which  supports  the  belief  that  the  RED  mechanism 
indeed  helps  break  the  synchronization  among  the  flows  suffered  by  drop-tail 
gateways.  In  Section  5.3,  we  show  that  the  limiting  model  is  also  consistent  with 
other  previously  proposed  models  in  their  respective  regimes.  When  the  capacity 
is  very  small  the  queue  behavior  approaches  that  of  a  processor  sharing  queue  and 
when  the  capacity  is  very  large,  its  behavior  is  similar  to  that  of  a  time-reversed 
shot-noise  model.  In  addition,  we  present  a  simple  analysis  on  the  buffer 
utilization  and  window  size  of  the  limiting  model  at  the  steady-state  in  Section 
5.4.  Under  mild  assumptions,  the  steady-state  analysis  suggests  that  only  the 
mean  RTT  affects  the  mean  queue  size  at  the  steady-state.  However,  the  variance 
of  the  round-trip  delay  plays  a  role  in  the  magnitude  of  the  queue  fluctuations. 
This  will  be  shown  with  the  help  of  a  Central  Limit  analysis  in  Section  5.5. 
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5.1  The  Model 


Our  model  captures  three  layers  of  dynamics,  namely  the  network,  transport,  and 
session  layers,  which  interact  with  each  other  through  mechanisms  to  be  specified 
shortly.  At  the  lowest  level,  the  network  is  simplified  to  be  a  single  bottleneck 
router,  namely,  a  RED  gateway  with  ECN  capability.  The  traffic  injected  into 
the  network  is  shaped  by  the  TCP  congestion-control  mechanism  in  the  transport 
layer,  which  reacts  to  the  marks  from  the  network.  Each  TCP  connection  is 
initiated  by  a  session,  such  session  being  either  active  or  idle.  If  a  session  is 
active,  a  hie  or  an  object  is  transferred  through  a  TCP  connection.  An  activity 
period  for  a  session  lasts  until  it  no  longer  has  any  data  to  transfer,  at  which 
time  it  goes  idle.  The  duration  of  an  idle  period  is  random  and  represents  the 
idle  time  between  consecutive  hie  transmissions.  When  a  new  hle/object  to  be 
transferred  arrives,  the  session  becomes  active  again  and  sets  up  a  new  TCP 
connection.  We  now  give  a  detailed  description  of  each  of  the  three  layers  of  the 
model  and  of  their  interactions. 

Time  is  slotted  in  contiguous  timeslots.  Here  the  RTTs  of  TCP  connections 
are  approximated  as  integer  multiples  of  timeslots,  i.e.,  a  timeslot  is  the  greatest 
common  divisor  of  the  RTTs  of  TCP  hows.  Similar  to  the  model  in  Chapter  4, 
we  fix  t  =  0, 1, ...  to  indicate  the  start  of  timeslot  [t,  t  +  1). 

Heterogeneous  round-trip  times 

Fix  i  G  A/".  We  assume  that  any  congestion-control  actions  by  TCP  hows, 
i.e.,  additive  increase  and  multiplicative  decrease,  occur  at  the  end  of  the 
round-trip.  The  RTT  of  how  i  at  timeslot  [t,  t  +  1)  is  denoted  by  the  V- valued  rv 
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D^N\t)  with  V  =  {2,3, ,  Dmax } • 1  The  bound  on  the  maximum  RTT  does  not 
constrain  the  model  because  actual  TCP  flows  also  cannot  have  larger  RTTs 
than  the  timeout  value.  We  use  /3^N\t  +  1)  to  denote  the  number  of  timeslots 
since  the  last  action  by  an  active  flow  i.  Then,  p\N\t)  evolves  according  to 

Df\t  +  1)  =  (l  +  /?f  >(t)l  >(()] )  1  [A'f ’(i)  >  o]  (5,1) 

where  X^N\t)  is  the  remaining  workload  (in  packets)  of  connection  i  at  the 
beginning  of  timeslot  [t,t  +  1).  The  rv  X-N\t)  is  greater  than  zero  only  if 
connection  i  is  active  in  timeslot  [t,t  +  1),  so  that  the  last  indicator  function  is 
one  only  if  the  connection  is  active.  This  will  be  explained  further  in  the  next 
subsection. 

Given  i  in  J\f  and  t  —  0, 1, . . .,  we  write 

Giit(a,  b)=a- 1  [/3f°  (t)  <  d\n)  (*)]  +6-1  (t)  >  d\n)  (*)]  ,  a,beR  (5.2) 

in  order  to  simplify  our  notation  later. 

Given  collections  of  M- valued  rvs  {17(f),  t  —  0, 1, . . and 
{Yi)new(t),t  =  0, 1, . . .},  we  see  that 

Yi(t  +1)  =  Gyf+ 1  (Yi(t),  Yi  new(t  +  1)) 

f  Yi:new  (t  +  1),  \t  +  1)  >  D\  \t  +  1) 

Yi(t),  otherwise. 

In  other  words,  the  value  of  Yt(t  +  1)  is  updated  to  Yi  new{t  +  1)  only  at  the  end  of 
round-trip.  Otherwise,  the  value  of  Yt(t  +  1)  remains  to  be  Y,  (t)  since  no  action 
will  be  taken  before  the  end  of  round-trip. 

1  Although  T>  does  not  include  one  in  our  model,  it  can  be  included  at  the  price  of  more 
cumbersome  proofs.  Moreover,  this  does  not  cause  any  loss  of  generality  of  the  model. 


74 


Session  dynamics 

Each  session  %  in  M  is  either  active  or  idle.  A  session  idle  at  the  beginning  of 
timeslot  [t,t  +  1)  has  no  packet  to  transmit  in  that  timeslot.  An  idle  session  in 
timeslot  [t,  t  +  1)  becomes  active  at  the  beginning  of  timeslot  [t  +  1,  t  +  2)  with 
probability  Par,0  <  Par  <  1,  independently  of  past  events.  In  other  words,  the 
duration  of  an  idle  period  is  geometrically  distributed  with  parameter  Par  (hence 
with  mean  1/Par).  This  attempts  to  capture  the  dynamics  of  connection  arrivals, 
where  the  interarrival  times  are  reported  to  be  exponentially  distributed  [53]. 2 
Let  {Ui(t),i  G  J\f ;  t  —  0, 1,  •  •  •  }  be  a  collection  of  i.i.d.  rvs  uniformly  distributed 
on  [0,  1],  and  let  1  [Ui(t  +  1)  <  Par]  be  the  indicator  function  of  the  event  that  a 
new  file/object  arrives  in  the  timeslot  [t  +  l,t  +  2)  for  an  idle  session  i. 

Let  {Fi(t),  i  G  A f]  t  —  0, 1,  •  •  • }  be  a  collection  of  i.i.d.  non- negative 
integer-valued  rvs  distributed  according  to  a  general  probability  mass  function 
(pmf)  F  on  {1,  2, . . .}.  The  workload  of  a  connection  for  session  i  that  becomes 
active  at  the  beginning  of  timeslot  [t,t  +  1)  is  given  by  Fi(t).  This  workload 
represents  the  total  number  of  TCP  segments3  the  connection  will  have  to 
transmit  during  this  activity  period.  Thus,  if  a  given  TCP  connection  is  used  to 
transfer  more  than  one  object,  this  workload  variable  Fi(t)  represents  the  total 
number  of  TCP  segments  brought  in  by  all  these  objects.  We  denote  by  A*(f)  the 
remaining  workload  (expressed  in  packets)  of  connection  i  at  the  beginning  of 
timeslot  [t,t  +  1).  Clearly,  we  have  X \{t)  =  0  if  session  i  is  idle  during  [t,t  +  1). 

2Recall  that  an  exponential  rv  X  with  parameter  a  can  be  approximated  by  |W],  which  is  a 
geometric  rv  with  parameter  p  =  1  —  e~a. 

3In  this  model  each  TCP  segment  is  transmitted  as  a  separate  packet. 
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The  evolution  of  X,  (t)  is  then  given  by  the  recursion 

xfb  +  i)  =  i[vf>(«)>o](xf,(()-4'v)w) 

+1  [xfb)  =  o]  1  \Ui(t  +  1)  <  P„]  F,(t  +  1)  ,  (5.3) 

where  A[N\t)  denotes  the  number  of  packets  injected  into  the  network  by 
connection  i  at  the  beginning  of  timeslot  [t,t  +  1).  This  will  be  explained  in  the 
next  subsection. 

When  a  new  connection  arrives,  its  RTT  is  randomly  selected,  and  the  RTT 
of  session  i  at  timeslot  [t  +  1,  t  +  2)  is  given  by 

D\N\t  +  1)  =  D?\t)  1  [xf  >(t)  >  0]  (5,4) 

+1  x\  \t)  =  0  1  [Ui(t  +  1)  <  Par]  D^new(t  +  1), 

where  the  P-valued  rvs  {Dinew{t  +  1),  t  =  1,2,...}  are  i.i.d.  rvs  which  determine 
the  RTT  of  newly  arrived  connections. 

TCP  dynamics 

For  each  i  in  A/”,  let  W-N^  (t)  be  an  integer- valued  rv  that  encodes  the  congestion 
window  size  (in  packets)  at  the  beginning  of  timeslot  [t,t  +  1).  We  assume  that 
the  rv  W^N\t)  has  range  {0, 1,  •  •  •  ,  l-'Fmax}  where  fTmax  is  a  finite  integer 
representing  the  receiver  advertised  window  size  of  the  TCP  connection  and  the 
congestion  window  size  of  an  idle  session  is  taken  to  be  zero.  When  an  idle 
session  becomes  active  at  the  beginning  of  timeslot  [t,t  +  1),  the  congestion 
window  size  of  the  TCP  connection  is  set  to  one  at  the  beginning  of  timeslot 
[t  +  1,  t  +  Di;new{t  +  1)  +  1),  where  Dt  new (t  +  1)  is  the  RTT  (determined  from 
(5.4))  for  the  new  active  session.  This  models  one  round-trip  delay  for  the 
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three-way  handshake.  We  now  describe  how  the  congestion  window  sizes  of  active 
connections  evolve. 

Each  TCP  source  transmits  as  many  of  the  remaining  data  packets  as  allowed 
by  its  congestion  window  only  at  the  end  of  the  round-trip.  We  simplify  the 
packet  transmission  in  the  round-trip  so  that  all  packets  from  a  connection  arrive 
only  in  a  single  timeslot,  rather  than  being  spread  out  throughout  a  round-trip. 
Such  simplification  can  be  justified  by  the  following  reasons: 

(i)  In  the  Internet,  most  of  the  packet  arrivals  at  a  bottleneck  are  usually  com¬ 
pressed  together  due  to  the  “ACK  compression”  phenomenon  [64],  which 
leads  to  bursty  arrivals  at  the  bottlenecks.  Hence,  modeling  the  packet  ar¬ 
rivals  over  a  RTT  as  a  batch  arrival  in  a  single  timeslot  tends  to  be  more 
accurate  than  modeling  them  as  smooth  arrivals  throughout  a  RTT. 

(ii)  Aggregating  a  round-trip  worth  of  packet  arrivals  into  a  single  timeslot  will 
result  in  burstier  traffic  from  each  flow.  This  will  cause  queue  dynamics  to 
fluctuate  more  than  having  a  smooth  arrival  pattern.  Therefore,  the  queue 
fluctuation  in  this  model  will  provide  an  upperbound  to  the  actual  queue 
with  smoother  packet  arrival  patterns. 

(iii)  The  information  used  for  control  action  at  the  RED  gateways  is  the  average 
queue  size.  Therefore,  the  difference  in  the  control  action  due  to  our  bursty 
packet  arrivals  will  be  smoothed  out  by  the  averaging  mechanism  with  long 
memory  in  RED. 

Suppose  that  connection  i  has  X-N\t)  remaining  packets  (or  workload) 
waiting  to  be  transmitted  at  the  beginning  of  timeslot  [ t,t  +  l),4  the  number  of 
4We  refer  to  a  TCP  connection  of  an  active  session  i  by  connection  i  when  there  is  no  confusion. 
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packets  transmitted  by  connection  i  at  the  beginning  of  timeslot  [t,t  +  1), 
denoted  by  A\N\t),  is  given  by 


A^\t)  =  min  [Wr)(t),Xr,(t) )  1  >  D^’it) 


r(N) 


[N)< 


m, 


m, 


(5.5) 


Note  from  (5.5)  that  a  connection  transmits  only  once  per  RTT. 

The  congestion-control  mechanism  of  TCP  operates  either  in  the  slow  start 
(SS)  or  the  congestion  avoidance  (CA)  phase.  A  new  TCP  connection  starts  in 
SS  in  order  to  quickly  gauge  the  available  bandwidth  of  the  network.  While  in 
SS,  the  congestion  window  size  is  doubled  every  round-trip  time  until  one  or 
more  packets  are  marked.  If  a  mark  is  received,  then  the  congestion  window  size 
is  halved  and  TCP  switches  to  CA.  The  congestion  window  size  is  limited  by  the 
receiver  advertised  window  size  Wmax.  Hence,  if  the  connection  is  in  SS,  then  the 
congestion  window  of  connection  i  evolves  according  to 

W$s(t+1)  =  mm(2Wt{N\t)Vl,Wma^M^N\t+l) 

+  rEAW1(i_MW(<  +  i))  _  (5.6) 

where  M^N\t  +  1)  is  an  indicator  function  of  the  event  that  no  packet  of 
connection  i  has  been  marked  in  the  round-trip  preceding  timeslot  [t,t  +  1),  i.e., 
A4-N\t  +  1)  =  1  when  no  packet  from  session  i  is  marked  and  M^N\t  +  1)  =  0 
when  at  least  one  packet  is  marked.  The  marking  mechanism  is  explained  in  the 
next  subsection. 

In  CA,  the  congestion  window  size  in  the  next  timeslot  [t  +  1,  t  +  2)  is 
increased  by  one  if  no  mark  is  received  in  timeslot  [t,t  +  1),  while  if  one  or  more 
packets  are  marked  in  timeslot  [t,t  +  1),  the  congestion  window  in  the  next 
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timeslot  is  reduced  by  half.  The  congestion  window  size  in  CA  is  then  given  by 


W$A(t  +  1)  =  min  +  1,  W»„)  Mf- '  (t  +  1) 

+(-lb^W-|  (x  _  +  i))  .  (5.7) 

Both  W^s(t  +  1)  and  W^A(t  +  1)  are  candidates  for  the  congestion  window 
size  in  timeslot  [t  +  l,t  +  2)  depending  on  the  state  that  the  connection  i  is  in. 

Since  we  only  update  the  congestion  window  at  the  end  of  the  round-trip,  we 
use  the  mapping  (5.2)  to  retain  the  value  of  the  congestion  window  until  the  end 
of  the  round-trip  where  it  is  updated.  If  connection  i  is  in  SS  in  timeslot 
[t,  t  +  1),  its  potential  window  W-^s{t  +  1)  in  timeslot  [t  +  1,  t  +  2)  is  given  by 

H®<  +  1)  =  GM+1[w'iw((),W'S(i  +  l)],  (5.8) 

where  W^g(t  +  1)  is  given  in  (5.6).  Similarly,  if  connection  %  is  in  CA  in  timeslot 

[t,  t  +  1),  its  potential  window  W^\(t  +  1)  in  timeslot  [t  +  1,  t  +  2)  is  given  by 

W$A(t  +  1)  =  Ght+1[wi{N\t),Wt%(t+  1)],  (5.9) 

where  W^A(t  +  1)  is  given  in  (5.7). 

Let  the  {0,  l}-valned  rvs  {SlN\t),i  e  A/”}  encode  the  state  of  TCP 
connections,  with  the  interpretation  that  S^N\t)  =  0  (resp.  S^N\t )  =  1)  if 
connection  i  is  in  CA  (resp.  in  SS)  at  the  beginning  of  timeslot  [t,  t  +  1). 
Therefore,  combining  (5.8)  and  (5.9),  we  see  that  the  complete  recursion  of  the 
congestion  window  size  can  be  written  as 

Wt(N)  (t  +  1)  =  1  [xi(iV)  (t)  -  a\n)  (t)  >  0 

X  (sf b)irS(t  + 1)  +  (1  -  Sf°(t)) w^A(t  + 1))  .(5.10) 
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The  first  indicator  function  in  (5.10)  is  used  to  reset  the  congestion  window  size 
to  zero  when  session  i  runs  out  of  data  to  transmit  and  returns  to  its  idle  state. 
Finally,  the  evolution  of  {SlN\t),  t  —  0, 1, . . .}  is  given  by 


sf°(t+  1)  =  1  X^’(t)  <  W^\t) 


AO, 


AN), 


+1 


xl*0®  >  wnt)  sr’mr’it + 1).  (s.n) 


r(N), 


AN) 


(AO, 


This  equation  can  be  interpreted  as  follows.  Connection  i  is  in  SS  in  timeslot 
[t  +  l,t  +  2)  if  either  (1)  there  is  no  packet  left  to  transmit  (so  the  connection 
resets)  at  the  beginning  of  the  timeslot  or  (2)  the  connection  was  active  and  in 
SS  in  timeslot  [t,t  +  1)  and  received  no  mark  in  the  timeslot.  Equation  (5.11) 
assumes  that  a  new  TCP  connection  in  SS  is  ready  to  be  set  up  one  timeslot 
after  the  previous  connection  is  torn  down  upon  finishing  its  workload,  and  the 
new  TCP  connection  becomes  active  when  a  new  file/object  arrives  initiating 
three-way  handshake.  We  also  assume  that  the  slow  start /congestion  avoidance 
state  is  updated  in  the  next  timeslot  following  transmission.  However,  the 
window  size  is  updated  one  RTT  after  transmission  using  the  appropriate  SS/CA 
state  as  in  the  correct  operation  of  TCP. 


Network  dynamics 

The  model  of  the  network  dynamics  is  identical  to  the  model  presented  earlier  in 
Chapter  4  with  two  exceptions.  First,  we  also  introduce  a  queue  averaging 
mechanism  to  further  generalize  the  model.  The  marking  probability  will  be  a 
function  of  the  average  queue  size  as  is  implemented  in  RED  instead  of  the 
instantaneous  queue  size.  Second,  the  indicator  function  M-N\t  +  1)  needs  to  be 
modified  in  order  to  represent  the  event  that  no  marks  have  been  received  in  the 
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previous  round-trip  (as  opposed  to  the  previous  timeslot  as  in  Chapter  4).  The 
remaining  dynamics  are  the  same  as  will  be  briefly  summarized  here. 


The  recursion  of  the  queue  is  written  in  the  a  similar  to  Lindley’s  recursion 
introduced  in  (1.9).  More  specifically,  it  can  be  rewritten  as 

Q{N\t+  1) 

N 

=  QtN\t)  -NC  +  Y.  min  (wCb).  W’W)  1 

i= 1 

where  A^N\t)  =  YliLi  (t)  and  A[N\t)  is  given  as  (5.5)  in  this  model. 

The  queue  averaging  mechanism  utilizes  an  exponentially  weighted  moving 
average  (EWMA)  filter  to  average  the  instantaneous  queue  size  with  the  time 
constant  depending  on  the  parameter  0  <  a  <  1.  Let  Q^N\t  +  1)  represent  the 
EWMA  queue  in  the  beginning  of  timeslot  [t  +  1,  t  +  2),  then 

Q{n) (t  +  1)  =  (1  -  a)Q™ (£)  +  aQ(N) (t  +  i).  (5.12) 


/}jN>(t)  >  d; r'(t) 


(N)l 


A  special  case  when  a  =  1  is  equivalent  to  using  the  instantaneous  queue  size  for 
marking  mechanism  as  in  Chapter  4. 

We  represent  the  marking  through  the  {0,  l}-valued  rvs  M-^\t  +  1) 

(j  =  1, ...,  A\N\t))  with  M-j\t  +  1)  =  0  (resp.  M-j\t  +  1)  =  1)  if  the  jth  packet 
from  source  %  is  marked  (resp.  not  marked)  in  the  RED  buffer.  More  concretely, 
for  each  i  in  J\f  and  j  =  1,  2, . . .,  we  write 


M<f  (t  +  1)  =  1  V,j(t  +  1)  > 


(5.13) 


where  again  the  collection  of  i.i.d.  [0, 1  ]-uniform  rvs 

{Vij(t  +  1),  Vi{t  +  1),  i,j  —  1,  •  •  •  ;  t  —  0, 1,>  •  •  }  are  assumed  independent  of  all 
other  rvs  introduced  so  far. 
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The  indicator  function  of  the  event  that  no  packets  from  connection  i  in 


timeslot  [t,  t  +  1)  are  marked  can  then  be  written  as 

nd“>(,l<N)(*+i),  i 


M 


(TV) 


(t  +  1)  — 


1, 


AwW  =  o. 


(5.14) 


This  information  will  be  available  to  the  TCP  sender  in  the  next  timeslot. 
However,  this  information  is  used  only  one  RTT  later  to  update  the  congestion 
window  size,  and  M-N\t  +  1)  evolves  according  to 


Mf'(i  +  1)  =  Gy(Mf  >((),  M<H(i  +  1)),  (5.15) 

Notice  that  we  replace  t  +  1  in  the  mapping  G  in  (5.2)  by  t  to  delay  the  update 
in  the  value  of  M^N\t  +  1)  in  order  for  (5.8)  and  (5.9)  to  evolve  based  on  the 
markings  in  the  previous  round-trip.  For  example,  if  w[N\t)  is  updated  in 
timeslot  [t,  t  +  1)  then  M-N\t  +  1)  is  updated  in  timeslot  [t  +  1,  t  +  2)  and  its 
value  will  be  used  in  the  timeslot  [t  +  D[N\t),t  +  D[N\t)  +  1)  to  determine  the 
new  congestion  window  size. 


5.2  The  Asymptotics 

The  first  main  result  of  the  chapter  consists  of  the  asymptotics  for  the  normalized 
buffer  content  as  the  number  of  sessions  becomes  large  similar  to  Theorem  2. 

This  result  is  again  discussed  under  the  following  Assumptions  (AWl),(AW2b) 
where  the  structural  condition  (AW1)  is  identical  to  that  in  Chapter  4  while 
(AW2b)  is  modified  from  (AW2)  to  incorporate  additional  initial  conditions,  i.e., 

(AW2b)  For  each  N  =  1,2,...  and  i  =  1, ...  ,N,  the  initial  conditions  of  rvs  in  the 
model  are  given  by 

q(n)(o)  =  q(n)(  o)  =  iy(7V)(  o)  =  /f0(0)  =  njN)(  o)  =  o, 
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and 


Sf°(0)  =  M}N)(  0)  =  1. 


We  denote  the  vector  of  state  variables  for  session  i  in  timeslot  [t,  t  +  1)  by 

Y,W(*)  :=  (tvlN,(t),x}K>(t),S<K>(t),DjK>(t),fllN>(t),M}N>(t)).  (5.16) 


The  rv  Y-  (t)  takes  a  value  in  the  discrete  set 


y  ■=  {0,  x  {0,1,...,  Xmax}  x  {0,1}  x  {0,2,3,  ...,Dmax} 


x{0, 1, . . . ,  Dmax}  x  {0,1}. 


(5.17) 


Theorem  4.  Assume  that  (AW1 )  and  (AW2b)  hold.  Then ,  for  each  N  =  1,  2, . . . 
and  t  —  0, 1, . . .,  there  exists  a  (non-random)  constant  q(t)  and  a  y-valued  random 


vector 


Y(t)  = 


such  that  the  following  holds: 


(i )  The  following  convergences  take  places: 


- 77 -  *iV  q[t),  - 77 - >N  '/ 1 1 1 


(5.18) 


A  ’(*)  =4,Y(f). 


(5.19) 


(ii)  For  any  bounded  function  g  :  — >  M,  we  have 


(t))  -7>jvE  [g  (Y(t))] 


(5.20) 
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(Hi)  For  any  integer  I  =  1,  2, ,  the  random  vector  {Y jN\t),i  =  1 be¬ 
comes  asymptotically  independent  as  N  becomes  large,  with 

i 

lim  P[Y f\t)  =  yui  =  1,  ■■■,/]  =  I]  P  [YW  =  y<]  (5-21) 

N—*oo  ■*- 

i=  1 

for  any  y,t  G  y,  i  =  1, .  . . ,  I. 

In  addition,  with  initial  conditions  q( 0)  —  W( 0)  =  X(0)  =  D( 0)  =  /3(0)  =  0, 
and  5(0)  =  M( 0)  =  1,  it  holds  that 

q(t  +  l)  =  (q(t)-C  +  E[A(t)])+  (5-22) 

and 

q(t  +  1)  =  (1  -  a)q(t)  +  aq(t  +  1)  (5.23) 

where 

A(t)  =  min  (W(i),  X(t))  1  [/3(t)  >  D(t)\ . 

Further,  the  recurrence 

Y(t  +  1)  =  (W(t  +  l),X(t+l),S(t  +  l),D{t+l),P(t  +  l),M(t  +  l)) 

=,t  P(Y(t)) 

■■=  (Pi(Y(t)),P2(Y(t)),...,Pe(Y(t)))  (5.24) 
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holds  in  law,  where  P\(Y(t)), . . . ,  Pq( Y ( t ))  are  given  by 


Pi(Y(t))  =  l[X{t)-A(t)>0](S{t)Wss(t  +  l)  +  (l-S{t))WCA(t  +  l)); 
P2(Y(t))  =  1  [X(t)  >  0]  (X(t)  —  A(t))  +  1  [X(t)  =  0]  1  [f/(t  +  1)  <  Par\  F(t  +  1), 
P3(Y (t))  =  1  [X(t)  -  A(t)  <  0]  +  1  [X{t)  -  A(t)  >  0]  S{t)M(t  +  1), 

P4(Y(t))  =  D{t)l[X{t)>0}+l{X{t)  =  0}l{U(t  +  l)<Par\D{t  +  l), 

P,(Y(t))  =  (1  +  /3(t)l  [0(t)<  D(t)])  1  [X (t)  >  0] , 
and 

Pe(Y(t))  =  Gt(M(t),l[V(t+l)<(l-mt)))A<‘>]), 
where 

M(t  + 1)  :=  P6(Y(t)), 

Gt(a,  b )  =  al  \fd{t)  <  D(t )]  +  61  [/?(£)  >  D(t)] ,  a,  b  E  R, 

Wss(t  +  1)  =  Gt+1  (w(t),mm(2W(t)Wl,Wmax)M(t+l)+\]^](l-M(t  +  l))Sj  , 
and 

WCA(t  +  1)  =  Gt+i  (t),  min  (W (t)  +  1,  Wmax)  M(t  +  1)  +  (1  —  M(t  +  1))^  , 

for  i.i.d.  [0, 1  }-uniform  rvs  {U(t  +  1),  V(t  +  1);  t  —  0, 1, . . .}. 


The  proof  of  Theorem  4  is  presented  in  Chapter  6. 


5.3  Limiting  Cases 

Next,  we  briefly  consider  the  resulting  model  from  Theorem  4  in  the  regime  when 
C  is  either  very  large  or  very  small  under  the  following  assumption: 

(AW3)  The  marking  function  /  :  R  — >  [0, 1]  is  monotonically  increasing  with  /( 0)  = 
0  and  linxE_>0O  f(x)  =  1; 
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It  is  easy  to  see  that  limc_>0c  q(t)  =  0  for  all  t  —  0, 1, . . so  that  the  marking 
probability  per  flow  also  converges  to  zero  from  (AW3)  for  all  t.  Therefore,  each 
incoming  flow  will  always  operate  in  the  slow  start  (exponential  growth)  mode 
and  the  resulting  input  traffic  into  the  network  is  the  superposition  of 
(discrete-time)  Poisson  arrival  streams  of  random  number  of  packets,  each  of 
which  doubles  its  window  size  every  round-trip.  The  aggregate  input  traffic  is 
therefore  similar  to  the  time-reversed  shot-noise  processes,  which  is  compatible 
with  the  model  studied  in  [23]. 

On  the  other  hand,  with  C  ~  0,  the  queue  will  start  building  up,  whence 
lim^oo  q(t)  =  oo.  Thus,  for  large  t,  all  TCP  flows  (including  incoming  TCP 
flows)  will  experience  marking  probability  close  to  one  under  Assumption  (AW3). 
All  active  connections  will  be  able  to  inject  only  one  packet  per  round-trip  into 
the  network  because  every  packet  transmitted  will  be  marked  with  a  probability 
going  to  one.  As  a  result,  each  TCP  congestion  window  size  approaches  one  with 
high  probability.  Since  the  bottleneck  router  will  transmit  packets 
non-selectively,  any  active  flow  will  receive  roughly  equal  throughput  and  hence 
the  queue  behavior  approaches  that  of  processor-sharing,  assuming  identical 
RTTs  among  all  flows,  which  is  in  agreement  with  [32], 

5.4  Steady-State  Regime 

Using  the  results  from  the  previous  sections,  we  now  carry  out  a  simple 
steady-state  analysis  with  a  few  additional  reasonable  assumptions.  Although  we 
have  not  formally  proved  that  the  system  converges  to  steady  state  as 
complications  arise  due  the  deterministic  (i.e.,  degenerate)  character  of  the  first 


two  components  in  the  time-homogeneous  Markov  chain 

{(q(t),  q(t),  Y (t)),  t  —  0, 1, . . simulation  results  indicate  that  the  system  does 
converge  to  steady  state  for  a  large  range  of  initial  conditions  and  parameters. 
This  is  illustrated  in  Section  5.6.  In  order  to  facilitate  our  further  analysis  we 
assume  the  following: 

(AW4b)  The  sequence  {(g(f),  Y(f)),  t  —  0, 1);. . .}  admits  a  steady  state  in  the  sense 
that  (q(t),Y(t))  =>t  ( q*,Y *)  for  some  rvs  (q*,Y*)  where  q*  is  a  constant 
and  Y*  =  (W*,  X *,  S*,  D\  (3*,  M*)  is  a  ^-valued  rv. 

(AW5)  Let  Far  be  a  rv  with  the  distribution  F ,  representing  the  initial  workload  size 
of  a  new  connection,  we  assume  E  [W*  ]  «  E[Far]. 

(AW6)  We  assume  that  when  an  active  connection  finishes  its  last  transmission,  it 
waits  an  additional  RTT  before  resetting  its  window  size  to  zero.  5 

We  first  introduce  a  simple  result  which  will  help  in  the  analysis.  Its  proof  is 
given  in  Section  5.7. 

Lemma  1.  Assuming  (AWl),(AW2b),(AW3),  (AW4b),(AW5)-(AW6),  the  rvs 
min(W*,  X*)  and  1  \/3*  >  D*]  are  conditionally  independent  given  that  the  con¬ 
nection  is  active. 

It  is  easily  seen  that  Assumption  (AW4b)  immediately  implies  q(t)  — q *  =  q * 
for  some  constant  q*.  And  so  the  steady-state  marking  probability  is 
f(q*)  =  f(q *).  We  wish  to  find  the  steady-state  queue  level  q *  as  a  fixed-point 
5This  will  have  only  a  marginal  effect  under  Assumption  (AW5). 
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solution  to 


q*  =  (q*  —  C  +  E  [A*])  + 

=  (q*  ~  C  +  E  [min(VF*,  X*)l  [/?*  >  £>*]])+.  (5.25) 


Since  the  window  size  and  the  workload  are  both  zero  when  a  session  is  idle, 
we  have 


E[A*]  =  P  [active]  E  [A*  |  active] 

=  P  [active]  E  [min(kF*,  X*) (active]  P  \j3*  >  Inactive]  (5.26) 


where  the  last  equality  follows  from  Lemma  1.  The  probability  P  [active]  that  a 
session  is  active  in  steady  state  is  given  by 

p  E  [connection  duration] 

E  [connection  duration]  +  E  [idle  period] 

by  elementary  arguments  from  renewal  theory. 

First,  we  can  rewrite 

P  \/3*  >  Tractive]  =  J2d.,  P  [P*  >  tractive,  D*  =  dt]  P  [D*  =  rfj| active]. 
Conditioning  on  the  event  that  D*  =  d,  it  is  easy  to  see  that 


P  [D* 


di  | active]  ~ 


djP  [D  =  di 

'Yhdj&v  ^j'P  [P*  =  dj. 


The  expression  is  not  exact  because  according  to  our  model,  the  connection 
resets  its  window  to  zero  one  timeslot  after  transmitting  its  last  packet.  However, 
this  will  have  only  a  marginal  effect  under  (AW5). 

From  Assumption  (AW5)  since  a  connection  typically  lasts  many  RTTs,  we 


P  \fi*  >  dj  |  active,  D *  =  dt 


have 


1 

di 


(5.27) 


Therefore, 


P  \/3*  >  D*  |  active]  ~  ^ 


P  [D  =  dj\ 
'Yhdj&v  °^'P  [D  =  dj 


1 

E W 


(5.28) 


Let  Far  be  the  initial  workload  and  D  be  the  RTT  of  a  connection.  The 
conditional  expected  value  of  the  connection  duration  given  the  workload  size 
and  round-trip  delay  is 


E  [connection  duration  |  Far  =  x,  D  =  d] 


x 

T(d,f(q *))  ’ 


where  T(d,  f{q*))  is  the  mean  throughput  of  a  TCP  connection  with  RTT  of  d 
and  packet  marking  probability  of  f(q*).  Here  we  approximate  the  average 
throughput  of  a  TCP  connection  by  the  well-known  throughput  formula 


T(d,f(q *)) 


K 


d\/  f  (q*j  ’ 

where  Jl  is  some  constant  in  the  interval  [1,  |],  according  to  [48,  40]. 
This  implies  that 

'dy/f(q*) 


E  [connection  duration |Far  =  x,  D  =  d\ 


xc 


K 


Since  the  initial  workload  and  the  RTT  are  assumed  independent,  we  have 
E  [connection  duration]  «  ETar]E [D]y/f(q  ) .  rj']:ieref()re^ 


P  [active]  « 


E  [Far]  E  [D\  \/rf(q*) 
E[Far]E[D}^f{^)+K/Par 


(5.29) 


Finally,  in  order  to  compute  (5.26),  we  need  to  calculate 
E  [min(W*, X*)| active],  which  can  be  approximated  by  E  [IF* [active]  under 


(AW5),  i.e.,  E  [F]  »  E  [IE*]. 


E  [VE* | active]  =  E  [W*\D*  =  (inactive]  P  [D*  =  (inactive] 

di£V 

x  ^  K  d,P  [D  =  dj\ 

££ 0  VJW)  T,djev  djP  iD  =  di\ 

K 

Vfiv*)' 

Combining  (5.26)-(5.30),  we  get 


(5.30) 


E  [A*]  « 


_ KE  [Far] _ 

E  [Far]  E  [D]  +  Kj  Par ' 


If  0  <  f(q*)  <  1,  it  is  necessary  that  C  —  E  [A*].  After  some  simple  algebras, 
we  can  show 


/(?*) 


K2 

E  [D]2 


Par  E  [Fa 


(5.31) 


If  /  is  invertible,  then 

q  ~  f  1  (e  [D]2  (c  ~  Pa,-E[Far])  )  •  (5‘32' 

Note  that  the  steady-state  marking  probability  and  the  average  queue  size 
depend  on  the  round-trip  delay  and  incoming  workload  distributions  only 
through  their  mean  values.  Numerical  examples  validating  the  analysis  are  given 
in  Section  5.6.  This  simple  formulation  can  be  used  as  a  guideline  on  how  to 
design  the  feedback  probability  function  to  control  the  queue  size  at  the 
steady-state,  given  the  system  parameters.  However,  the  variance  of  the 
round-trip  delay  will  play  a  role  in  the  magnitude  of  the  queue  fluctuations  even 
though  the  mean  queue  size  is  determined  only  through  the  average  delay.  This 
will  be  shown  in  the  Central  Limit  analysis  in  the  next  section. 
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5.5  A  Central  Limit  Theorem 


In  this  section  we  present  a  CLT  complement  similar  to  Theorem  3  to  the  LLN 
results  in  Theorem  4.  The  description  of  the  model  and  the  notations  are  given 
in  Section  5.1.  The  analysis  is  carried  out  under  the  same  model  and  we  again 
need  to  strengthen  Assumption  (AW1)  to  (AWlb)  as  given  in  Section  4.4. 
However,  we  also  introduce  Assumption  ( AT) : 

(AW7)  The  workload  of  a  new  TCP  connection  is  bounded,  i.e.,  there  exists  an 
integer  Xmax  such  that  F^N\t)  is  a  {1,  •  •  •  ,  Xmax  [-valued  rv  for  all  i  in  Af  and  all 

t  =  0,1,....  6 

Fix  t  —  0, 1, . . ..  Again,  the  asymptotic  residual  capacity  given  in  (4.27)  plays 
a  crucial  role  in  the  analysis.  In  this  model,  it  is  given  by 

K{t)  =  C  -  q(t)  -  E  [min(W(f),  X(t))l  [/ 8(t )  >  D(t)]]  . 

Now  define  a  collection  of  rvs  that  is  integral  to  the  analysis.  For  each 
N  =  1,  2,  3, . . .  and  y  =  (w,  x,  s,  d,  b,  m)  €  y ,  let  L^\t)  be  as  given  in  (4.23).  We 
also  introduce  the  following  notations: 

K\t)  ■■=  AT-'M’  <5-33> 

and 

4N)(()  :=  1  EihV’m  Iyl  “  Py(‘>  |y| '  (5-34) 

Theorem  5.  Assume  (AWlb),(AW2b)  and  ( AW7 )  hold.  Then,  for  each  t  = 
0,1,...,  there  exists  an  M.\y\+2 -valued  rv  L(t)  =  (L0(f),  L0(t),Ly(t),  y  G  T)  such 

6The  limit  Xmax  can  be  lifted.  This,  however,  results  in  a  more  complicated  analysis.  Never¬ 
theless,  such  a  restriction  is  necessary  for  the  numerical  calculation  of  the  CLT  on  a  computer. 
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that  the  convergence 


s/N  (LW(t),LW(t),LW(t),  yey 


L(<) 


(5.35) 


holds.  Moreover,  the  distributional  recurrences 


L0(t  +  1)  —  st  \ 


L0{t)  +  L(t) 
{Lo(t)  +  L(t )) 


K{t)  >  0 
K(t)  <  0 
K(t)  =  0 


(5.36) 


and 


L0(t  +  1)  —  st  (1  —  a)L0(t)  +  aLo(t  +  1) 


(5.37) 


hold,  where 

m  =  E  min(it;,x)l  [b  >  d]  Ly(t). 
y  ey 

The  distribution  of  the  rv  Ly{t ),  y  G  y,  t  —  0, 1, . . .,  can  be  calculated  recur¬ 
sively  starting  with  t  =  0. 

Finally,  for  any  t  =  1,2,.. the  rv  Lo(t  +  1)  is  Gaussian1  if  K(s)  ^  0  for  all 
s  <  t. 


A  proof  of  Theorem  5  and  the  complete  distributional  recursions  are  provided 
in  [57], 

From  the  last  statement  of  Theorem  5,  a  necessary  condition  for  Lo(t)  not  to 
be  Gaussian  is  K (s)  =  0  for  some  s  <  t.  This  is,  however,  a  technical  artifact  of 
the  limiting  regime  as  in  practice  it  is  unlikely  that  the  real-valued  residual 
capacity  would  attain  the  value  zero  exactly.  Therefore,  in  practice  the 
distribution  of  the  RED  buffer  at  any  fixed  time  t  can  be  well  approximated  by  a 
Gaussian  rv. 

7Here  we  interpret  a  constant  as  a  Gaussian  rv  with  standard  deviation  equals  to  zero. 
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Discussion 


The  CLT  analysis  reveals  the  sources  of  fluctuation  in  the  queue  size.  It  is  shown 
in  the  proof  of  Theorem  5  that  the  queue  fluctuation  L0(t  +  1)  consists  of  four 
components  : 

(i)  Fluctuation  caused  by  the  discrepancy  between  the  feedback  information 
from  RED  to  TCP  sources  f^(Q^N\t))  and  the  limiting  feedback  infor¬ 
mation  f(q(t)):  This  uncertainty  in  feedback  information  can  be  explained 
by  the  Delta  Method  (Proposition  1)  and  is  already  discussed  in  details  in 
Section  4.4. 

(ii)  Binary  nature  of  feedback  information:  The  RED  gateway  either  marks  a 
packet  or  does  not  in  a  RTT.  This  binary  nature  of  feedback  information 
imposes  a  limited  feedback  information  granularity,  and  causes  a  fluctuation 
in  queue  size.  This  fluctuation  can  be  well  approximated  by  a  Gaussian  rv. 

(iii)  Fluctuation  caused  by  the  arrival  of  new  TCP  connections  and  the  random 
idle  periods:  The  larger  the  file  size,  i.e.,  workload  of  a  new  TCP  connection, 
and  waiting  time  variances,  the  larger  the  magnitude  of  this  fluctuation  is. 
This  part  of  the  fluctuation  can  also  be  described  by  a  Gaussian  rv. 

(iv)  Fluctuation  caused  by  the  structure  of  protocols:  The  structure  of  the  pro¬ 
tocols  determines  the  mappings  which  combined  the  rvs  discussed  in  (i)-(iii). 
The  resulting  rv  represents  the  overall  fluctuation  observed  at  the  queue. 

Components  (ii)  and  (iv)  are  due  to  the  protocols  and  cannot  be  mitigated 
without  modifying  the  protocols.  Component  (iii)  depends  on  users  behavior,  and 
hence  is  beyond  the  control  of  the  network.  Thus,  network  designers  can  only 


93 


manipulate  the  shape  (or  slope)  of  the  feedback  function  to  reduce  oscillation  of 
queue  size.  Although  reducing  the  slope  of  the  queue  can  decrease  the  magnitude 
of  fluctuation,  it  also  increases  the  average  queue  size  as  suggested  by  (5.32). 

The  aforementioned  trade-off  can  be  explored  in  the  context  of  an 
optimization  problem.  Suppose  that  we  have  the  convergences  q(t)  — q  <?*  and 
L0{t)  =>t  Lq.8  Then,  the  steady-state  queue  distribution  can  be  approximated  by 
Nq*  +  VNL*.  For  best-effort  traffic,  the  marking  function  /  should,  for  example, 
maximize  P  0  <  Nq*  +  \/NLq  <  NB  ,  assuming  that  the  buffer  size  is  NB. 
Clearly,  a  more  sophisticated  performance  metric  can  be  adopted,  which  depends 
on  the  probabilities  of  an  empty  queue  and  of  buffer  overflow.  The  availability  of 
the  queue  distribution  allows  us  to  formulate  the  corresponding  optimization 
problem  and  to  propose  a  systematic  solution. 

Furthermore,  if  the  protocol  suite  {i.e.,  TCP  and  ECN/RED)  can  be 
modified,  then  we  can  further  reduce  the  magnitude  of  queue  fluctuation  caused 
by  limited  feedback  granularity,  i.e.,  component  (ii).  One  simple  scheme  to 
improve  the  feedback  information  granularity  is  to  increase  the  number  of 
feedback  information  bits  in  the  ECN  mechanism.  Given  that  the  improved 
feedback  information  is  properly  utilized,  the  magnitude  of  queue  fluctuation 
would  be  reduced.  Multi-level  ECN  (MECN)  [10]  is  an  example  of  such  a  scheme. 

8Note  that  both  q *  and  Lq  depend  on  /. 
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5.6  Simulations  of  the  Model  with  Session-level  Dynamics 


and  Variable  Round-trip  Delays 

In  the  previous  chapter,  we  have  presented  simulation  results  which  point  to  the 
convergence  of  the  normalized  queue  when  the  number  of  persistent  TCP  flows  is 
large.  In  this  section,  we  extend  the  simulations  to  incorporate  both  session-level 
dynamics  and  the  variable  round-trip  delays  of  connections  in  order  to 
demonstrate  that  a  similar  convergence  still  exists  and  also  reveals  the  roles  that 
hie  size  and  round-trip  have  on  the  convergence  of  the  queue  along  with 
validation  of  the  steady-state  asymptotic  queue  formula  in  (5.32). 

Numerical  examples 

This  section  presents  numerical  examples  to  study  the  behavior  of  the  queue  size 
per  how.  The  numerical  examples  are  evaluated  through  Monte-Carlo 
simulations  of  the  model  in  Section  5.1. 


Example  (i) 


The  system  and  control  parameters  are  set  as  follows:  C  —  1  packet/timeslot  and 


f{N\x)  = 


0,  x  <  2 N 

0.2^yf,  2N  <  x  <  20N 
1,  otherwise. 


The  initial  values  are  set  to  according  to  Assumption  (AW2b).  The  variables 
evolve  from  their  initial  values  according  to  the  dynamics  outlined  in  Section  5.1. 
The  workload  iq(f)  ~  geometric(p) ,  i  =  1,  2, . . . ,  N  and  t  —  0, 1, . . .  where 
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p  =  0.001,  i.e.,  E  [Fi(t)}  =  1,  000  packets.  The  idle  periods  of  sessions  are 
geometrically  distributed  with  a  mean  of  20  timeslots.  The  receiver  advertised 
window  size  TTmax  is  set  to  64.  The  exponential  averaging  parameter  a  is  set  to 
0.01. 

First,  we  simulate  the  dynamics  when  the  random  round-trip  delay  Dt  new(t) 
is  uniformly  distributed  on  the  set  V  —  {2, . . . ,  6}.  Figure  5.1  plots  the  evolution 
of  the  queue  size  per  flow  with  the  number  of  sessions  N  =  100,  500, 1000,  5000, 
and  10000.  As  expected,  the  oscillation  in  the  queue  size  per  flow  decreases  with 
increasing  N.  Given  the  parameters  used  in  this  example  with  K  =  y/3/2  (which 
is  shown  in  Section  4.3.2  to  be  a  reasonable  approximation),  we  have  from  (5.32) 
q *  =  f^1  (j  —  o  054000 )2)  =  10-1-  Hence,  as  can  be  seen  from  Figure  5.1,  the 

steady-state  queue  size  is  close  to  the  value  predicted  by  (5.32). 

In  the  next  example,  we  demonstrate  the  role  of  the  variance  of  the 
round-trip  delay.  In  this  case,  the  round-trip  delay  of  each  connection  is  either  2 
or  6  with  equal  probability,  i.e.,  Bernoulli  rv  with 

P  [ Dimwit )  =  2]  =  P  [Di)1iew{t)  =  6]  =  0.5,  t  =  0, 1, . . .,  which  has  the  largest 
variance  of  any  distribution  on  T>  with  the  mean  of  4.  The  rest  of  setup  is 
identical  to  the  previous  case.  Figure  5.2  plots  the  evolution  of  the  queue  size  per 
flow.  While  the  queue  size  converges  to  the  same  value  as  in  the  previous 
example,  notice  that  (i)  the  magnitude  of  the  fluctuation  for  the  same  number  of 
users  is  greater  when  the  round-trip  distribution  is  Bernoulli,  and  (ii)  the 
convergence  to  steady-state  is  slower  (in  time)  for  Bernoulli  distribution.  This 
clearly  demonstrates  that  while  the  steady-state  mean  queue  size  depends  only 
on  the  mean  of  the  round-trip  delay,  the  transient  behavior  and  the  magnitude  of 
fluctuation  are  affected  by  the  variance  of  round-trip  delay. 
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Figure  5.1:  Evolution  of  queue  size  per  flow  when  the  round-trip  is  uniform. 
Example  (ii) 

In  the  second  example,  we  change  the  capacity  per  flow,  workload  and  the  idle 
period  distribution  to  be  as  follows:  C  =  0.6  packet /timeslot,  and  workload 
m  geometric(p),i  —  1, . . . ,  N  and  t  —  0, 1,  •  •  ■ ,  where  p  =  0.005, 
i.e.,  E  [Ft(t)]  =  200  packets.  The  idle  periods  of  sessions  are  geometrically 
distributed  with  a  mean  of  5  timeslots.  The  rest  of  the  parameters  are  identical 
to  Example  (i). 

Again,  we  simulate  the  dynamics  when  the  random  round-trip  delay  Di  new(t ) 
is  uniformly  distributed  on  the  set  V  =  (2, . . . ,  10}  so  the  average  round-trip 
delay  equals  6  timeslots.  Figure  5.3  plots  the  evolution  of  the  queue  size  per  flow 
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Figure  5.2:  Evolution  of  queue  size  per  flow  when  the  round-trip  is  Bernoulli. 


with  the  number  of  sessions  N  =  100,  500, 1000,  5000,  and  10000.  Figure  5.4  plots 
the  evolution  when  Dinew(t )  has  the  distribution 

P  [Diinew(t)  =  2]  =  P  [Diinew(t)  =  10]  =  0.5,  i  =  1,  2, . . N  and  t  =  0, 1, . . .,  so 
that  E  [. Di:new(t )]  =  6. 

Given  the  parameters  used  in  this  example  again  with  K  =  a/3/2,  we  have 

2n 


q*  «  r1  \K2 


C E  [D]  Par  E  [Fa 


=  f 


-i  1 


2  V(0.6)(6)  0.2  •  200y 


=  10.63, 


which  is  close  to  the  steady-state  queue  level  in  both  simulations.  Therefore,  this 


20 


Figure  5.3:  Example  (ii)  :  Evolution  of  queue  size  per  flow  when  the  round-trip  is 
uniform. 

verifies  that  the  steady-state  average  queue  size  depends  only  on  the  mean 
round-trip  delay. 

The  numerical  examples  presented  here  support  the  conclusions  that  (i)  the 
oscillation  in  the  queue  size  per  flow  decreases  with  increasing  N,  (ii)  the  average 
queue  size  at  steady-state  depends  only  on  the  mean  RTT,  and  (iii)  the 
magnitude  of  the  queue  fluctuation  depends  on  the  distribution  of  the  RTT. 
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Figure  5.4:  Example  (ii)  :  Evolution  of  queue  size  per  flow  when  the  round-trip  is 
Bernoulli. 

NS- 2  simulation  results 

In  this  section  we  verify  our  analysis  by  a  more  realistic  event-driven  NS 
simulations.  In  the  simulation  we  gradually  vary  the  number  of  sessions  from  25 
to  1,000,  and  study  the  queue  dynamics.  The  system  parameters  used  in  the 
simulation  are  scaled  with  the  number  of  sessions  N  as  follows:  the  bottleneck 
link  capacity  =  0.24  •  N  Mbps,  the  bottleneck  buffer  =  25  •  N  packets. 
The  bottleneck  RED  gateway  with  ECN  option  is  configured  as  follows: 

/( N\x)  =  f(N~1x )  (he.,  a  scaling  similar  to  Assumption  (AW1))  with 
qlnin  —  2  ■  AT,  qmax  =  10  ■  N,  and  Pmax  =  0.1  with  the  gentle  mode  enabled.  The 
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receiver  advertised  window  Wmax  is  set  to  64  packets  and  the  packet  size  is  fixed 
to  1,000  bytes.  The  exponential  averaging  weight  of  the  RED  gateway  is  set  to 
0.02/JV  in  order  to  have  a  similar  time  constant  in  all  cases.  A  session  generates  a 
geometrically  distributed  workload  that  with  a  mean  of  100  packets,  and  the 
interarrival  times  of  the  new  workloads  for  each  session  are  exponentially 
distributed  with  a  mean  of  3.3  seconds.  When  a  session  runs  out  of  data  to 
transfer,  it  terminates  the  TCP  connection.  A  new  TCP  connection  is  initiated 
by  the  session  when  the  next  workload  arrives  for  the  session.  We  also  enable  the 
dropjront _  option,  i.e.,  the  RED  gateway  marks  the  packet  at  the  front  of  the 
queue  rather  than  the  packet  that  has  just  arrived,  in  order  to  reduce  the 
feedback  delay  of  the  marks  to  the  TCP  senders. 

The  simulation  results  are  obtained  with  two  types  of  round-trip  delay 
distributions.  First,  the  round-trip  propagation  delays  of  the  sessions  are 
randomly  selected  uniformly  from  [52, 121.5]  ms,  with  a  mean  of  87  ms.  The 
simulation  result  is  shown  in  Figure  5.5.  Next,  the  round-trip  propagation  delays 
of  the  sessions  are  randomly  selected  to  be  either  52  ms  or  121.5  ms  with  equal 
probability  (i.e.,  i.i.d.  Bernoulli  rv).  This  delay  distribution  also  has  the  mean  of 
87  ms  but  with  much  higher  variance.  Figure  5.6  shows  the  simulation  result 
from  this  setting. 

Notice  that  the  fluctuations  in  the  normalized  queue  level  decrease  as  the 
number  of  sessions  N  increases.  Furthermore,  observations  on  steady-state  queue 
level  and  fluctuations  are  all  in  agreement  with  the  conclusion  from  Section  5.6. 
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Figure  5.5:  Queue  dynamics  of  the  NS-2  simulation  where  the  round-trip  distribu¬ 
tion  is  uniform 

5.7  A  Proof  of  Lemma  1 

First,  the  marking  probability  is  fixed  by  (AW4b)  and  each  of  the  session  at  the 
steady-state  is  independent  from  each  other.  Recall  that  the  size  of  the  workload 
as  the  connection  is  initiated  and  the  round-trip  delay  are  independent.  We 
notice  that  conditioning  on  the  event  that  the  connection  is  active, 

P  [min(VF*,  A*)  =  w\D *  =  d,  active]  =  P  [min(hF*,  A"*)  =  inactive].  This  is 
because  the  distribution  of  rv  min(VF*,  A"*)  during  the  transmission  time  is 
independent  of  D*  (this  can  be  proven  by  induction  as  a  consequence  of  the  fixed 
marking  probability  (AW4b))  and  rv  min(W*,  A*)  in  between  the  transmission 
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Figure  5.6:  Queue  dynamics  of  the  NS-2  simulation  where  the  round-trip  distribu¬ 
tion  is  Bernoulli 

time  has  the  same  value  as  min(VF*,  X*)  at  the  latest  transmission  time. 

This  also  implies 

P  [D*  =  d\  min  (IF*,  X*)  =  w,  active]  =  P  [D*  =  tractive]  ,9 
Furthermore, 

P  \J3*  >  D*  |  min  (IF*,  X*)  =  w,  active,  D*  =  d]  =  P  [/?*  >  Tractive,  D *  =  d] 

because  once  the  connection  is  active  and  the  round-trip  time  is  d,  the 
probability  of  the  counter  j3*  greater  or  equal  than  d  does  not  depend  on 

9This  can  be  viewed  as  a  consequence  of  the  fact  that  the  initial  workload  rv  and  the  RTT  rv 
are  independent. 
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min  ( W* ,  X*) .  F inally, 


P  \J3*  >  D*\  X*)  =  w,  active] 

=  ^  P  [/F  >  D*|  min(IF*,  X*)  =  w,  active,  D*  =  d] 
dev 

•P  [D*  =  d\  min  (IF*,  X*)  =  w,  active] 

=  P  [/F  >  D*  (active,  D*  =  d]P  [D*  =  d\  min  (IF*,  X*)  =  w,  active] 
dev 

=  P  [P*  >  -D*|active,  D*  =  d]  P  [D*  =  d|active] 
dev 

=  P  \P*  >  D*  |  active] , 

and  the  desired  result  follows  directly  from  the  last  equality. 
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Chapter  6 


Weak  Laws  of  Large  Numbers 


In  this  chapter,  we  present  the  proofs  of  the  Weak  Laws  of  Large  Numbers  for  the 
window-based  models  in  Chapters  4  and  5,  i.e.,  Theorems  2  and  4.  The  proofs  of 
these  theorems  utilize  similar  ideas  revolving  around  induction  of  convergence 
statements  at  each  time  t  —  1,2,....  The  recursion  of  the  asymptotic  queue  size 
q{t )  is  established  as  a  byproduct  of  the  proofs  of  the  convergence  statements. 

Before  presenting  the  proofs  of  Theorems  2  and  4,  we  first  present  in  Section 
6.1  a  generic  model  of  congestion-controlled  flows  and  RED  and  then  state  a 
Weak  Law  of  Large  Numbers  for  this  model  (Theorem  6).  Next,  we  show  that 
the  convergence  statements  in  Theorems  2  and  4  are  mere  corollaries  of  Theorem 
6  in  Section  6.2  and  6.3,  respectively.  The  proof  of  Theorem  6  is  presented  in 
Section  6.4.  Finally,  Section  6.5  establishes  the  distributional  recursions  in 
Theorems  2  and  4  as  a  byproduct  of  the  proof  of  Theorem  6. 

6.1  A  Weak  Law  of  Large  Numbers  of  a  Generic  Model 

In  this  section,  we  develop  a  generic  model  of  congestion-controlled  flows  and  a 
RED  gateway.  A  Weak  Law  of  Large  Numbers  for  the  model  is  then  presented. 
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This  result  not  only  helps  facilitate  the  presentation  of  the  proofs  of  Theorems  2 
and  4,  but  also  yields  insights  on  the  crucial  elements  in  the  development  of  these 
limit  theorems. 

We  denote  the  vector  of  state  variables  for  session  i  in  timeslot  [t,  t  +  1)  by 
Y \N\t).  The  rv  Y \N\t)  takes  values  in  a  discrete  space  y.  Recall  the  recursion 
of  the  queue  is  given  by  (1.9),  where  A[N\t)  represents  the  amount  of  packets 
injected  into  the  network  in  timeslot  [t,t  +  1)  by  session  i  and  depends  on  the 
particular  model.  More  specifically,  the  rv  (t)  is  determined  as  a  function  of 
Y^(t),  i.e.,  A\N\t)  =  <f>(Y|iV\f)),  for  some  bounded  continuous  function 
<f>  :  y  — >  R.  Further,  the  recursion  of  the  queue  average  used  in  the  marking 
mechanism  is  given  by  (5.12).  A  special  case  when  a  —  1  is  equivalent  to  using 
the  instantaneous  queue  size  for  marking  mechanism,  e.g.,  the  model  in  Theorem 
2. 

The  marking  mechanisms  in  Chapter  4  and  5  are  similar  and  will  be  briefly 
summarized  here.  Each  incoming  packet  into  the  router  in  timeslot  [t,t  +  1)  is 
marked  with  a  probability  (^Q^N\t) j ,  depending  on  the  average  queue 
length  Q(N\t)  at  the  beginning  of  the  timeslot  [t,t  +  1).  We  represent  this 
possibility  by  the  {0,  l}-valued  rvs  M-j\t  +  1)  (j  =  1, ...,  A[N\t))  with  the 
interpretation  that  M-J\t  +  1)  =  0  (resp.  M-J\t  +  1)  =  1)  if  the  jth  packet 
from  source  i  is  marked  (resp.  not  marked)  in  the  RED  buffer. 

To  do  so  we  introduce  a  collection  of  i.i.d.  [0, 1] -uniform  rvs 
{Vij(t  +1),  i,  j  =  1,  •  •  •  ;  t  —  0, 1,  •  •  •  }.  Given  t  —  0, 1, . . .,  the  rvs 
{Vij(t  +1),  i,  j  —  1,  •  •  •  }  are  assumed  to  be  independent  of  all  the  events 
happening  prior  to  the  beginning  of  timeslot  [t  +  1,  t  +  2).  For  each  i  —  1, . . . ,  N 
and  j  —  1,2,. . .,  we  define  the  marking  rvs  M-j\t  +  1)  as  in  (5.13),  so  that  the 
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rv  M-‘J>  (t  +  1)  is  the  indicator  function  of  the  event  that  the  jth  packet  from 
source  i  is  not  marked  in  timeslot  [t,t  +  1).  The  indicator  function  of  the  event 
that  no  packets  from  connection  i  in  timeslot  [t,t  +  1)  are  marked  can  now  be 
written  as  M-^ew(t  +  1)  given  in  (5.14). 

With  Tt  =  cr  {<3^(0),  Yf°(0),  Vi:j(s),  1  <  s  <  t;  i,  j  —  1,  2, . . .  j,  define 

zf\t)  :=  E  +  l)\Ft 

/  \  A(N)(t ) 

=  ‘  u.  (6.1) 

In  other  words,  z\N\t )  is  the  conditional  probability  that,  given  connection  i 
will  receive  no  marks  during  timeslot  [t,t  +  1). 

Theorem  6  is  discussed  under  the  following  assumptions. 

(Gl)  There  exists  a  continuous  function  /  :  M+  — >  [0, 1]  such  that  for  each  N  = 
1,2,.. .,  Equation  (1.1)  holds. 

(G2)  For  each  N  —  1,2,...,  the  dynamics  (1.9)  and  (5.12)  start  with  the  conditions 

Q(iV)(0)  =  Q{N\t)  =  0, 

and 

Yf°(0)=y,  i  —  1, . . . ,  TV 

for  some  y  in  y. 

(G3)  For  any  bounded  mapping  g  :  y  — >  M,  there  exists  a  bounded  and  continuous 
mapping  Fg  :  [0, 1]  x  y  — >  M  such  that 

E  [9  (Y f>((  +  1))  |^]  =  F,  (z.W(t),  Yf>(())  .  (6,2) 
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Assumption  (Gl)  is  identical  to  Assumption  (AW1)  in  Theorems  2  and  4,  while 
Assumption  (G2)  generalizes  Assumption  (AW2)  and  (A2Wb)  in  Theorems  2 
and  4,  respectively.  It  implies  that  for  any  t  =  0, 1, . . .  the  collection  of  rvs 
Y [N\t), . . .  ,Y^\t)  are  exchangeable.  We  will  show  in  the  discussion  of 
Theorems  2  and  4  that  Assumption  (G3)  indeed  holds.  Assumption  (G3)  states 
that  given  the  events  leading  up  to  the  beginning  of  timeslot  [t,t  +  1),  the 
expected  behavior  of  session  i  leading  to  the  beginning  of  timeslot  [t  +  1,  t  +  2) 
can  also  be  determined  using  knowledge  of  the  conditional  marking  probability 
zjN>  (t)  and  of  Y jN\t).  Note  that  (G3)  immediately  implies 


E 


9  (Y<">(«  +  1))]  =  E  [F,  (zf  >(«),  Yf »(«) 


(6,3) 


Theorem  6.  Assume  that  (G1)-(G3)  hold.  Then,  for  each  N  =  1,2,...  and 
t  =  0, 1, . . .,  there  exists  (non-random)  constants  q(t),  q(t)  and  a  y-valued  rv  such 
that  the  following  holds: 


(i )  The  following  convergences  take  place: 


and 


Q{N\t) 

N 

QW{t) 

N 


~~*N  q(t ) 
—*n  q(t) 


Y(t). 


(6.4) 

(6.5) 


(6.6) 


(ii )  For  any  bounded  function  g  :  y  — » ►  M, 

]tEfli9(Y.<A')(«))  (s  (Y(i))] .  (6,7) 
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(Hi)  For  any  integer  I  =  1,  2, ,  the  rvs  {Y jN\t),i  =  1 become  asymp¬ 
totically  independent  as  N  becomes  large,  with 

i 

lim  P[Yf>(t)  =  yf,  i  =  l, ■••,/]  =  TTpp-(*)=yJ  <6-8) 

N—>oo 

i= 1 

for  any  y,  G  y,  i  =  1, . . . ,  /. 

In  particular,  if  we  take  g  =  $  in  (6.7),  we  find 

£EHi4K)W  Av  E[#)]'  (6.9) 

From  Theorem  6,  we  only  need  to  show  that  the  models  in  Theorems  2  and  4 
both  satisfy  Assumption  (G3),  i.e.,  identify  the  mapping  Fg  in  (6.2),  and 
subsequently  establish  the  distributional  recursions  to  finish  the  proofs  of  these 
theorems.  The  proofs  of  Theorems  2  and  4  will  be  presented  following  this 
approach  in  Section  6.2  and  6.3,  respectively. 

6.2  Proof  of  Theorem  2 

According  to  the  model  in  Chapter  4,  the  state  variable  Y^  (t)  is  W '[N'>  (t)  and 
the  state  space  y  is  the  set  {1, . . . ,  lTmax } .  The  exponential  weighted  average 
parameter  is  a  =  1  and  the  average  queue  length  Q^N\t)  used  for  the  marking 
mechanism  coincides  with  the  instantaneous  queue  length  Q(N\t)  for  all 
t  —  0, 1, . . ..  The  number  of  packets  injected  into  the  network  A[N\t)  is  given  by 
W-N) (t) .  It  still  remains  to  show  that  Assumption  (G3)  in  Theorem  6  and  the 
distributional  recursion  (4.10)-(4.11)  hold. 

For  some  positive  integer  p  >  1,  consider  an  arbitrary  mapping  g  :  IN  — > 

With  g  we  associate  the  mappings  g*,  g*  :  IN  — »  given  by 

g*(w)  ■=  g( min  {w  +  1,  VFmax)),  w  G  IN  (6.10) 
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and 


9*(w)  :=  ^(min  Wmax)),  w  G  IN.  (6.11) 

Now  fix  i  =  1, . . . ,  N  and  t  —  0, 1, . . ..  It  follows  from  (4.5)  that 

9(W'f>(«  +  l))  (6.12) 

=  Mf  >(«  +  imwr\t))  +  (1  -  Mf  >(«  +  1)) g.(w}N\t)) 

Let  Tt  denote  the  n-held  generated  by  the  rvs 

{Q(JV)(0), W;(JV)(0), V^(s), ^-(s),  i,j  =  1,2,...;  s=l,...,t}. 

The  rvs  Q^N\t)  and  W-N\t),  i  —  1, . . . ,  N,  being  all  ^-measurable,  it  holds 


under  the  enforced  independence  assumptions  that 

E  [m<J°  (f  +  l)|js]  =  1  -  f{N\Q{N)m  j  =  1,  2, . . . 

so  that 

E  [iwf°(f  +  1)|JS]  =  z\N\t )  (6.13) 

by  conditional  independence,  where  we  have  set 

z\N\t)  =  (1  - /W(QW(f)))^W(t)  •  (6.14) 

It  is  now  plain  that 

Mf°(t  +  1)  1  [V#  +  1)  <  zf°(i)]  .  (6.15) 

It  readily  follows  from  (6.12)  that 

E  [g(WiN\t  +  l))|Jn]  =  F,(zf\t),  W,(N' '(«))  (6.16) 

where  the  mapping  :  [0, 1]  x  N  ->  is  associated  with  g  through 

Fg(z,w)  =  zg*(w)  +  (1  -  z)g*(w),  zG  [0,1],  w  e  IN.  (6.17) 
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Thus,  we  can  take  the  mapping  specified  by  (6.17)  to  be  the  mapping  Fg 
which  ensures  (6.2)  and  Assumption  (G3)  thus  holds.  The  convergence 
statements  in  Theorem  2  is  now  a  direct  corollary  of  Theorem  6.  It  remains  only 
to  establish  the  distributional  recursions  (4.9)-(4.11).  This  will  be  accomplished 
in  Section  6.5  as  it  is  a  byproduct  of  the  proof  of  Theorem  6. 

6.3  Proof  of  Theorem  4 

The  state  variable  Y \N\t)  in  the  model  of  Chapter  5  is  given  in  (5.16)  with  the 
state  space  y  given  by  (5.17).  In  the  model,  the  exponential  weighted  average 
parameter  lies  in  the  range  (0, 1]  and  the  average  queue  length  Q^N\t)  is  used  for 
the  marking  mechanism  instead  of  the  instantaneous  queue  length  Q(N^(t)  for  all 

t  —  0, 1, _  Finally,  we  have  A\N\t)  given  by  (5.5). 

Fix  i  =  1 , ,N  and  consider  an  arbitrary  bounded  mapping  g  :  — »  R: 
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Through  a  careful  case  analysis,  it  follows  that 


j(Yf  >(i  + 1)) 


(6.18) 


=  1 


+  1 


A'f  >(«)  =  o' 

Xl?  (0,  1  [Ui(t  +  1)  <  Par ]  Fi(t  +  1),  1,  1  [Ui(t  +  1)  <  Par]  Di,new{t  +  1),  0,  1) 


Xf\t)  >  Af\t)  >  0 
xsavf’W,  xf\t)  -  +  l)Sf  >(t),  £>f ’(<),  1,  NlZljt  +  1)) 

+  1  [o  <  x[N\t)  <  /if  >(*)]  11(0,0, 1, Of  f),  1, MSU*  +  X)) 

+  1  \xf- \t)  >  0,/jf  >( t)  =  Of  >(<)  -  1 


*g(wSl(t + 1),  xr ’(«),  sr'(t),Dr(t),  orw,  <"(t» 

A'ff)  >  0,0, ff)  <  Off)  -  1 

L 

xg(W,m(t),xW(t),sW(t),Df\t),rtN\t)  +  1 


qWm  nWm  nWi 


rW/ 


+  1 


where 

sWfiV  +  1),  Af  >((),  Sf  >(*),  I>f  f),  Off),  Mf  >(*)) 

=  Mffjsff)^1  (wffiJ.A, ffj.off)) 

+  Mf>w(i  -  sf  >(«))e,2  (wf°  w,  a ff),  of ’(*)) 

+  (l-Mff))Ffwff),Aff),Off)).  (0-19) 

The  mappings  Fj ,  Fg  and  F^  :  Z(j_  — >  R  are  associated  with  g  and  defined  by: 

Fg  (w,x,d)  =  g  (min  (2w  V  1,  Wmax) ,  x,  1,  d,  d,  1) , 

Fg  (w,  x,d)  =  g  (min  (w  +  1,  Wmax) ,  x,  0,  d,  d,  1) , 

Fg  (w,x,d)  =  g(j^],x,  0,  d,  d,  o)  .  (6.20) 

Let  Ft  denote  the  a- field  generated  by  the  rvs  {(2^(0),  Y^(0),  Fj(s),  Fj(s), 
Di,new(s),  bi(s),  Vij(s)  ,  i,  j  =  1,  2, , , .  ;  s  =  1, . . . ,  t  }.  So  the  rvs  Q(7V)(^)  and 
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Y jN\t)  (i  —  1 ,N)  are  ^-measurable.  Under  the  enforced  independence 
assumptions,  it  holds  that 

E  [M%\t  +  1)|^]  =  1  -  fW(Q{N\t)),  /../  1.2 . 

Therefore. 

M%\t  +  l)\Pt  =  zf\t)  (6.21) 

by  conditional  independence,  where  we  have  set 

z\N\t)  =  (l- f(N)(&N\t))}  '  .  (6.22) 

It  is  now  clear  that 

A(t) 

J]  M%\t  +  1)  =st  1  [U(t  +  1)  <  zf°(t)]  .  (6.23) 

i=  1 

Finally,  it  readily  follows  from  (6.18)  that 


e[9(Y,ot  («+l))K 

=  1  [xf  >(f)  =  o] 

XE  [g  (0,  1  [Ui(t  +  1)  <  Par]  Pi  (t  +  1),  1,  1  [Ui(t  +  1)  <  Par \  Ditnew(t  +  1),  0,  1)] 

+  1  [xf  >(t)  >  Af\t)  >  o' 

y\zf\i)g(w!N\i),  Xf\t)  -  AfHt),  Sf  >(«),  £>f  >(t),  1, 1) 

+(1  -  ziN}(t))g(W,m(t), xf  >(()  -  Xf  >(t),0,  £>f  >(f),  1, 0)] 

+  1  [o  <  X,m(t)  <  a\ ;N\t) 

x[zf  >(i)9(0,0, 1,  A,K)(«).  1. 1)  +  (1  -  Zf\t))g(  0,0, 1,  Df\t),  1,0)] 

+  1  [xf  >(«)  >  0,/3 iff)  =  D?\t)  -  l 

9(W^w(t  +  1),  Xf  >((),  Sf  >(*),  Df\t),  Bf  >(«),  Mf  >(«)) 

+  1  [xf >(()  >  0,/?f '(()  <  Off)  -  l' 
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FAZiMHt),wjNHt),xW(t)iSP(t),Df\t),tiNHt),MW(t))  (6,24) 


where  the  mapping  Fg  :  [0, 1]  x  Z+  x  Z+  x  {0, 1}  x  Z+  x  Z+  x  {0, 1}  — >  M  is 
associated  with  g  through 

Fg(z,w,x,s,d,b,m )  (6.25) 

=  1  [x  =  0] 

XE  [g  (0,  1  [Ui(t  +  1)  <  Par]  Pi  (t  +  1),  1,  1  [Ui(t  +  1)  <  Par ]  Di,new(t  +  1),  0,  1)] 
+  1  [x  >  min  ( w ,  x)  1  [b  >  d  >  0]] 

x  [zg(w,  x  —  min  ( w ,  x)  1  [6  >  d  >  0] ,  s,  d,  1, 1) 

+  (1  —  z)g{w ,  x  —  min  (w,  x)  1  [b  >  d  >  0] ,  0,  d,  1,  0)] 

+  1  [0  <  x  <  min  ( w ,  x)  1  [b  >  d  >  0]]  [^^(0,  0, 1,  d,  1, 1)  +  (1  —  z)g( 0,  0, 1,  d,  1,  0)] 
+  1  [x  >  0,  b  =  d  —  1]  gnew{u h  x,  s,  d,  m) 

+  1  [x  >  0,  b  <  d  —  1]  g(w,  x,  s,d,b  +  1,  m), 

with 

9new(w,  xi  s j  d)  =  msFy  ( w ,  x ,  d)  +  m(l  —  s)F^  ( w ,  x,  d)  +  (1  —  rn)Fg(w,  x,  d). 

We  note  that 

E  [g  (0,  1  [Ui(t  +  1)  <  Par]  Fi(t  +  1),  1,  1  [Ui(t  +  1)  <  Par]  Di:new(t  +  1),  0,  1)] 

always  exists  and  is  finite  by  the  boundedness  of  the  mapping  g.  Furthermore, 
the  mapping  Fg  is  continuous  with  respect  to  the  product  topology  on  the  set  W 
Thus,  the  mapping  Fg  specified  by  (6.25)  ensures  (6.2)  and  Assumption  (G3) 
thus  holds.  The  convergence  statements  in  Theorem  4  is  again  a  direct  corollary 
of  Theorem  6.  The  distributional  recursions  for  Theorem  4  will  be  established  in 
Section  6.5. 
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6.4  A  Proof  of  Theorem  6 


We  introduce  the  following  terminology  to  facilitate  the  discussion:  For  each 
t  —  0, 1, . . the  statements  [A:t],  [B:t],  [C:t]  and  [D:t]  below  refer  to  the 
following  convergence  statements: 

[A:t]  Equations  (6.4)  and  (6.5)  hold  for  some  non-random  constants  q{t)  and  q(t). 
[B:t]  Equation  (6.6)  holds  for  a  W  valued  rv  Y (t). 

[C:t]  For  any  integer  /  =  1,  2, . . .,  the  rv  i  =  1 become  asymp¬ 

totically  independent  with  large  N  as  described  by  (6.8),  where  Y (t)  are  the 
rvs  occurring  in 

[D:t]  For  any  bounded  mapping  g  :  y  — >•  M,  the  convergence  (6.7)  holds  with  Y (t) 
being  the  rv  occurring  in  [B:t], 

With  the  help  of  a  series  of  lemmas,  we  shall  prove  the  validity  of  the 
statements  [A:t]-[D:t]  for  all  t  —  0, 1, . .  ..  We  do  so  by  induction  on  t  and  in  the 
process  establish  Theorem  6. 

Lemma  2.  Under  (Gl)  and  (G3),  if  [A:t]  and  [B:t]  hold  for  some  t  =  0,1,..., 
then  [B:t+1]  holds. 

Proof.  Together  the  convergence  [A:t]  and  [B:t]  imply  [29,  Thm.  5.28,  p.  150] 
the  joint  convergence 

(N-'QW^YWit))  =^;v  Y(t))-  (6.26) 

Next  the  continuity  of  the  mappings  /  implies  that  of  (x,  y)  — >  (1  —  /(x)),I>(^y^  on 
M+  x  y,  so  that 

(ZrWWW)  (Z(«),Y(f))  (6.27) 
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by  the  Continuous  Mapping  Theorem  [29,  Thm.  5.29,  p.  150]  with 


Consider  (6.3)  for  any  bounded  arbitrary  mapping  g  :  y  — >  M,  and  recall  that 
the  mapping  Fg  defined  by  (6.2)  is  continuous  on  [0, 1]  x  y.  Consequently,  the 
Continuous  Mapping  Theorem  can  again  be  invoked  to  yield 

Fg(z\N\t),Y^\t))  Fg(Z(t),Y(t)),  (6.28) 

whence 

lim  E^Zy'W.YrW)]  =  E[F„(Z(t),Y(t))]  (6.29) 

iv— >00  L  J 

by  the  Bounded  Convergence  Theorem  [29,  Thm.  4.16,  p.  108].  Combining  (6.3) 
and  (6.29)  we  get 

Hm  E  [9(Y<k1(«))]  =  E[FS(Z(«),Y(«))]  (6.30) 

N—kx)  L  J 

and  the  bounded  mapping  g  being  arbitrary,  it  follows  immediately  that 

Yi^t+l)  =$■  jv  Y(t+1) 

for  some  y~ valued  rv  Y  (t  +  1)  characterized  by 

Efa(Y(i+l))]  =  E  [Fg  (Z(t),Y(t))] .  (6.31) 

□ 

Lemma  3.  Under  (Gl),  (G3),  if  [A:t]  and  [D:t]  hold  for  some  t  =  0, 1, . . .,  then 
[A:t+1]  also  holds. 
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Proof.  From  [A:t]  and  [D:t]  (in  particular  (6.9)),  we  conclude  that 

At  9(t)  -  C  +  E  [4(f)]  (6.32) 


and  the  desired  result  is  a  simple  consequence  of  the  continuity  of  the  function 
x  — >  x+  since 


Q^(t+  1) 
N 


QW(t) 


N 


C 


EilA 


(N) 


(t) 


for  all  N  =  1,  2, . . ..  Therefore, 


q(t  +  l)  =  [q(t)-C  +  E[A(t)]]  +  .  (6.33) 

Also  ^  A1  9(0  from  [Art] ,  then  it  is  simple  to  see  that 

QW(i  +  l)  „  ,#">(«)  ,  «<"»(( +  1) 

- V -  =  (1-“)^^  +  a - N - 

~^n  (1  -  a)q(t)  +  aq(t  +  1) 

=  q(t  +  1).  (6.34) 

□ 


Lemma  4.  Under  (G1)-(G3),  if  [A:t],  [B:t]  and  [C:t]  hold  for  some  t  —  0, 1, . . ., 
then  [C:t+1]  also  holds. 


Proof.  We  first  observe  from  (6.2)  that  for  a  fixed  N,  the  rvs  Y [N\t  +  l),i  = 

1,...,N  are  coupled  only  through  the  marking  probability  which  depends  only 

on  Q(N\t).  Fix  a  positive  integer  I.  The  rvs  Y[N\t  +  1), . . .  ,YjN>)  ( t  +  1)  are 

mutually  independent  given  fFt.  Consequently,  for  arbitrary  bounded  mappings 

9i,  ■  ■  ■  ,9i  :  y  R,  we  get 

f  1  1  / 


E 


ll9-(Y.OT(«  +  i))in 


He  Si(Y,w(t  +  i))|^ 


i=  1 


i= 1 
I 


i=  1 
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with  the  help  of  (G3). 

Now  it  follows  from  (6.8)  in  [C:t]  that  the  joint  convergence 

(vr>  Y«n)W)  (Y  1(t),...,YI(t)) 

holds  where  limiting  rvs  Yi(t) . . . ,  Y i(t)  are  i.i.d.  random  vectors  each  distributed 
according  to  Y (t).  As  in  the  proof  of  Lemma  2,  the  arguments  leading  to  the 
convergence  (6.28)  also  lead  to 

(cpzrw,  Yf’W), . .  .,Fm(z\N\t),  Yf  >(())) 

(F91(Z,(i),  Y,(i)),  ....  Fgi(Zj(t),  Y  ,(*)) 

where  the  limiting  rvs  (Z1(t),Y1(t)),...,(Z/(t),Y/(t))  are  i.i.d.  rvs  each  dis¬ 
tributed  according  to  the  pair  (Z(t),Y(t)).  Therefore,  by  the  Bounded  Conver¬ 
gence  Theorem, 

~  i  j  r  / 

lira  E  n*(Yf)((+1))  =  lira  E  nF*(Zf,M>Yf'l(f)) 

iv— >00  A  A  iv— >00  A 

_i= 1  J  li=l 

"  I 

=  e 

j= i 
i 

=  IIE1F*(Z‘W-Y‘W)] 

2=1 

I 

=  IIE[»(Y‘(t)))  <6-35) 

i=i 

where  the  last  equality  made  use  of  the  relation  (6.31).  The  desired  result  [C:t+1] 
now  follows  from  (6.35)  given  that  the  mappings  gi, ... ,  <?/  are  arbitrary.  □ 

Lemma  5.  Under  (G1)-(G3),  if  [A:t],  [B:t]  and  [C:t]  hold  for  some  t  —  0, 1, . . ., 
then  [D:t]  holds. 
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Proof.  Pick  a  bounded  mapping  g  :  y  — >  R.  Under  (G2)  the  rvs  Y [N\t),  i  = 
1, . . . ,  N,  are  exchangeable,  so  that 

var 

N 

=  Ar-2^var[9(Yf,(«))] 

i= 1 
N 

+  N-2  J2  cov[9(Y,(,v)(«)), <,(¥<">(())] 

*,j=M7b 

=  7V-1var[^(YjJV)(i))]  +  ^-j^coy[g(Y{N\t)),g(YiN\t))}.  (6.36) 

Now  let  IV  go  to  infinity  in  (6.36):  The  validity  of  [C:t]  and  the  Bounded 
Convergence  Theorem  already  imply 

lim  cov[g(Y{N\t)),g(Y{2N\t))]  =  cov[^(Yi(t)), g(Y2(t))}  =  0  (6.37) 

iv— >00 

by  asymptotic  independence.  On  the  other  hand, 

lim  sup  var[p(Y^(t))]  <  oo 

AT— >oo 

since  g  is  bounded. 

Combining  these  observations  we  readily  see  that 

lim  var  EiI1^(Y-JV)(i))l  =0, 

TV — >oo  L 

whence,  by  Chebyshev’s  inequality, 


N 


EZMyPW)  -  e  [±ZZiS(YP(t))' 


(6.38) 


This  last  convergence  is  equivalent  to 

wZZ.iS(Y.m(t))  -  E  [.(Yrb))]  -G  0 


by  exchangeability,  and  the  desired  convergence  result  (6.7)  is  now  immediate  once 
we  remark  under  [B:t]  that  lim^  E  ^(Y^t))]  =  E  [ff(Y(t))J.  □ 
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We  now  conclude  with  a  proof  of  Theorem  6:  We  first  note  that  under 
(G1)-(G3)  the  statements  [A:t]  [D:t]  trivially  hold  for  t  =  0.  Moreover,  if 
[A:t]-[C:t]  hold  for  some  t  =  0, 1, . . then  so  do  the  statements  [D:t],  [B:t+1], 
[A:t+1]  and  [C:t+1]  by  Lemma  5,  Lemma  2,  Lemma  3  and  Lemma  4, 
respectively.  Consequently,  the  statements  [A:t]-[D:t]  do  hold  for  all  t  —  0, 1, . . . 
by  induction  and  the  validity  of  Claims  (i)-(iii)  of  Theorem  6  is  now  established. 

6.5  A  Note  on  the  Distributional  Recursions 

The  proof  of  Lemma  3  also  shows  that 

^  ~*N  ?(t  + 1) 

and 

««  +  i) 

with  non-random  q(t  +  1)  and  q(t  +  1)  determined  by  (6.33)  and  (6.34), 
respectively.  Therefore,  the  asymptotic  queue  recursions  (4.9)  in  Chapter  4  and 
(5.22)-(5.23)  in  Chapter  5  hold. 

A  moment  of  reflection  and  a  comparison  to  the  decomposition  in 
(6.16)-(6.17)  and  the  analysis  leading  up  to  (6.31)  in  Lemma  2  will  convince  the 
reader  that  the  distributional  recursions  (4.10)-(4.11)  in  Theorem  2  hold. 
Similarly,  by  using  the  decomposition  (6.24)-(6.25),  we  establish  the 
distributional  recursion  (5.24)  in  Theorem  4. 
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Chapter  7 


A  Proof  of  Central  Limit  Theorem 


In  this  chapter,  we  present  the  proof  of  the  Central  Limit  Theorem  for  the 
window-based  model  in  Chapter  4,  i.e.,  Theorem  3.  As  in  the  proof  of  Theorem 
6,  we  shall  proceed  by  induction  on  t  with  the  help  of  a  series  of  technical  facts. 
Again  the  discussion  is  facilitated  by  introducing  for  each  t  —  0, 1, . . a  number 
of  auxiliary  convergence  statements. 

Throughout  the  discussion,  for  each  t  —  0, 1, . . .,  we  shall  find  it  useful  to  write 
Z(t)  =  (1  -  =  7 («)'V(,)  (7.1) 

with 

7(f)  =  1  -  /(?(*))■  (7-2) 

Also  we  note  a  simple  but  useful  consequence  of  Claim  (i)  of  Theorem  2, 
namely  that 

/( AC  !(q{t))  (7-3) 

by  the  continuity  of  /. 

With  the  aim  to  simplify  the  presentation,  for  arbitrary  integer  p  and 
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arbitrary  mapping  g  :  IN  — >  Mp,  we  define 

Lf\t)  :=  i  f  -  E  [g(H/(i))]j  ,  N  =  1,2,... 

for  each  t  —  0, 1, . . ..  Note  that  L^N\t)  corresponds  to  the  choice  g[w)  =  w. 

The  discussion  is  facilitated  by  introducing  for  each  t  —  0, 1, . . a  number  of 
auxiliary  convergence  statements,  namely  [E:t]  and  [F:t]  where 

[E:t]  For  arbitrary  integer  p  and  arbitrary  mapping  g  :  IN  — >  there  exists  an 

W,+ 1  -valued  rv  (L0(t),  Lg(t))  such  that  the  joint  convergence 

VN{L^(t),L^(t))  (L„(t),Lg(t))  (7.4) 

takes  place; 

[F:t]  For  arbitrary  integer  p  and  arbitrary  mapping  g  :  IN  —>  there  exists  an 
Mp+1-valued  rv  (L0(t  +  1),  Lg(t ))  such  that  the  joint  convergence 

VN(L{0N\t  +  l),L^\t))  (L0(t+l),Lg(t))  (7.5) 

takes  place  with  the  rv  L0(t  +  1)  related  to  L0(t)  through  the  distributional 
recurrence  (4.26). 

The  convergence  statements  propagate  in  time  as  the  discussion  now  shows: 

Proposition  2.  Under  (AWlh)-(AW2) ,  if  [E:t]  holds  for  some  t  =  0, 1, . . .,  then 
[F:t]  holds  with  M-valued  rv  L0(t+ 1)  satisfying  the  distributional  recurrence  (4.26). 

Proposition  3.  Under  (AWlb)-(AW2) ,  if  [E:t]  holds  for  some  t  —  0, 1, . . .,  then 
[E:t+1]  also  holds. 
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In  the  process  of  establishing  Proposition  3,  we  will  obtain  the  following  fact: 
For  arbitrary  mapping  g  :  IN  — >  Rp,  the  limiting  rv  Lg(t  +  1)  has  the  following 
form 

Lg(t  +  1)  =st  Lgt(t )  +  /  ( q(t))Rg(t)Lo(t )  +  Hg(t  +  l)  (7-6) 

where  Rg(t)  is  introduced  at  (7.26)  and  the  Revalued  rv  Hg(t  +  1)  is  a  zero-mean 
Gaussian  rv  with  covariance  matrix 

a j(t  +  1)  :=  E  \g(W(t))g(W(t))'Z(t)(l  -  Z(t ))]  (7.7) 

and  this  Gaussian  rv  is  independent  of  the  rvs  Lgt(t )  and  L0(t).  The  mappings  g 
and  gt  are  defined  at  (7.15)  and  (7.17),  respectively. 

We  complete  the  proof  of  Theorem  3  by  an  easy  induction  argument:  For 
t  —  0,  [E:t]  trivially  holds  since  for  each  N  —  1,  2, . . we  have  (0)  =  W  for 
all  i  =  1, . . . ,  N  and  W{ 0)  =  W .  It  is  now  plain  from  Proposition  2  and 
Proposition  3  that  [E:t]  and  [F:t]  both  hold  for  all  t  —  0, 1, . . .,  and  Theorem  3  is 
established. 


7.1  A  Proof  of  Proposition  2 

Fix  7  =  0,1,...  and  N  =  1,  2, . . ..  We  begin  by  noting  that  under  [E:t],  the 
mapping  g  being  arbitrary,  it  is  the  case  that  there  exists  an  Rp+2-valued  rv 
(L0(t),  L(t ),  Lg(t ))  such  that  the  joint  convergence 

^/N{L^\t),m{t),Lf\t))  => at  (L0(t),L(t),Lg(t))  (7.8) 

takes  place.  Indeed  it  suffices  to  use  [E:t]  with  the  mapping  w  — »■  (■ w,g(w )). 

As  we  seek  to  identify  Lo(t  +  1),  we  rewrite  the  limiting  recursion  (4.9)  in  the 
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form 


?(*+!)  =  («(*)  -  C  +  E  [W'(i)])+  =  (-K(t))*  (7,9) 

with  K(t)  given  by  (4.27).  Combining  this  observation  with  the  queue  dynamics 
(4.2),  we  get 


N 


Li"\t+  1)  = 


Q{N)(t)  n  ,  1 


+ 


N 


■*■(<))■ 


AT 


max  (  4N)(t)  +  ±  J2  W,fN\t)  -  E  [W(t)} ,  A'(i)  )  -  K(tY 


r(N) 


N 


i=  1 


=  max  ^Lo^(i)  +  L^N\t),  K(t )  j  —  Jl  (t)  + 


(7.10) 


so  that 


V^VX^t  +  l)  (7.11) 

=  max  (Vw  (^(t)  +  Z(A°(*))  ,  y/NK(tj)  -  VNK(t)+. 

Invoking  the  Continuous  Mapping  Theorem,  we  conclude  from  (7.8)  that 

VN  (. L{0N\t )  +  Iw(i))  A0(t)  +  Z(t).  (7.12) 

Three  cases  emerge  depending  on  the  sign  of  K(t).  If  K(t)  =  0,  then  (7.11) 
reduces  to 

VNL{0N\t+l)  =  (Vn  (lW(t)  +  LW(tj))+  (7.13) 

again  by  the  Continuous  Mapping  Theorem  and  the  convergence  (7.12)  yields 

VNL{0N\t  +  1)  =y- N  (L0(t)  +  L(t))+  . 

If  K(t)  <  0,  then  (7.11)  reduces  to 

VNL^\t  +  1)  =  max  (VN  (. L{0N\t )  +  LW(t))  ,  -VNlKit)^ 
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and  the  convergence  (7.12)  yields 


Vnl[  \t  +  1)  =^jv  Lq  (t)  +  L(t ) 

since  |A'(f)|  >  0  guarantees  linpv-»oo  \/N\  K(t)  \  =  oo. 
Finally,  if  K(t)  >  0,  then  (7.11)  reduces  to 


VNL^\t+  1)  =  max  (v^v  +  iw(l))  -  V^VAT(i),o) 

and  the  convergence  (7.12)  yields  VNL^\t  +  1)  0  since 

limjv-oo  y/N K (t)  =  oo.  This  completes  the  proof  of  Proposition  2. 


□ 


7.2  A  Key  Decomposition 

To  establish  Proposition  3,  we  start  with  an  arbitrary  mapping  g  :  IN  — >  Mp  for 
some  positive  integer  p,  and  define  the  mappings  g*,g *  :  IN  — >  by  (6.10)  and 
(6.11),  respectively. 

Fix  N  —  1,  2, . . i  —  1, . . . ,  N  and  t  —  0, 1, . . ..  Making  use  of  (4.5)  and 
(4.10),  we  get 

g(WlN\t  +  1))  -  E  [g(W(t  +  1))] 

=  M\N\t  +  1  )<f (H'P'W)  +  (1  -  Aff  >(*  +  1  ))9.(W™(t)) 

-E  [M(i  +  1  )g*(W(t))  +  (1  -  M(t  +  l))j„(H'(())] 

=  g.(W,m(t))-E[g.(W(t))} 

+  (s'WfHt))  -  s.fwf  >(*)))  +  1) 

-E  {(g*(W(t))  -  gt(wm  M(t  +  1)) .  (7.14) 
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Therefore,  introducing  the  mapping  g  :  IN  — »  defined  by 


g(w)  =  g*(w)i¥-  g*(w),  w  G  IN  (7.15) 

we  obtain  the  decomposition 

9(H',(N)(*+1))-E[9(H''(*+1))] 

=  9.(»f°W)-E  [<;,(»-(«))] 

+  s(Wim(t))  (Mf  >(i  +  1)  -  zf »(«)) 

+  g(wlN\t))  (z\N\t)  -  7(«)w'‘",|,)) 

+  SW™  Wb«H'‘<''>(‘)  -  E  \s{W{t))Z(t)] .  (7.16) 

Finally,  defining  the  mapping  gt  :  IN  — >  by 

9t(w)  ■■=  g*{w)  +g(w)  •  y(f)w,  w  e  IN  (7.17) 

we  find 

J(llfv|(«  +  l))-E[9(W-(«+l))] 

=  g,(W,m(t))-E\gt(Wm 

+  g(w^m(t))  (M[N\t  + 1)  -  zf°(t)) 

+  9(W, im(t))  (zf  >(t)  -  .  (7.18) 

This  decomposition  forms  the  basis  for  the  subsequent  analysis.  The  subsequent 
sections  discuss  the  needed  asymptotics  for  each  of  the  three  terms  of  (7.18). 
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7.3  The  First  Piece  of  the  Puzzle 


To  study  the  contribution  of  the  second  term  of  (7.18),  we  proceed  as  follows;  For 
any  mapping  g  :  IN  — »  Rp,  set 

i  N 

Hf\t  +  1)  :=  i  £9(  wjN)(t))  (ilff  >(*  +  1)  -  Zf\i))  ,  N  =  1,2,... 

i= 1 

for  each  t  —  0, 1, . . ..  We  begin  with  the  case  p  —  1. 

Proposition  4.  Assume  (AWlb)-(AW2)  to  hold  and  consider  an  arbitrary  map¬ 
ping  g  :  IN  — >  M.  Then,  for  each  t  =  0,1,...,  it  holds  that 

E  exp  (jdVNH(gN\t  +  1))  \Tt  e~^t+1\  9  eR  (7.19) 

with 

as(t+l)  :=  E  [g(W(t))2Z(t)(  1  -  Z( ())]  .  (7.20) 

The  proof  of  Proposition  4  is  given  in  Section  7.8.  By  using  the  standard 
Cramer- Wold  device  [6,  Thm.  7.7,  p.  49]  we  obtain  the  following  analog  in  higher 
dimensions: 

Corollary  1.  Assume  ( AWlb)-(AW2 )  to  hold  and  consider  an  arbitrary  mapping 
g  :  IN  — ■>  Then,  for  each  t  =  0, 1, . . .,  it  holds  that 

E  exp  (jO'VNH(gN\t  +  1))  \Tt  9  e  Rp  (7.21) 

with 

Tg(t  +  1)  :=  E  [g(W(t))g(W(t)YZ(t)(  1  -  Z(t))] .  (7.22) 
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We  conclude  with  the  following  crucial  by-products:  For  some  t  =  0, 1, . . . , 
consider  the  situation  where  a  sequence  of  Revalued  rvs  {A^N\t),  N  =  1,  2, . . .} 
weakly  converges,  say 

A{N\t)^NA(t)  (7.23) 

for  some  limiting  Revalued  rv  A(f).  If  for  each  N  —  1,  2, . . the  rv  A ^N\t)  is 
measurable,  then  Corollary  1  readily  implies  [6,  Thm.  3.2,  p.  21]  that 

(VNH(gN\t  +  l),MN\t))  (Hg(t  +  l),A(t))  (7.24) 

for  some  zero-mean  Gaussian  rv  Hg(t  +  1)  with  covariance  matrix  T,g(t  +  1).  The 
rv  Hg(t  +  1)  is  taken  to  be  independent  of  the  rv  A(f). 

7.4  The  Delta  Method 

We  begin  by  recalling  a  well-known  byproduct  of  the  Central  Limit  Theorem, 
known  as  the  Delta  Method  [29,  p.  214],  Its  statement  is  given  as  Proposition  1 
in  Chapter  4.  The  proof  is  provided  in  Section  7.6  as  it  is  crucial  for  establishing 
the  joint  convergence  statement  at  the  end  of  this  section. 

We  are  now  in  a  position  to  handle  the  contributions  of  the  last  term  of  the 
decomposition  (7.18).  For  any  mapping  g  :  IN  — >  Rp,  set 

i  N 

Zf’w  =-  £9(WC  >«)  -  W)wr{,,)  .  N  =1,2,... 

i=  1 

for  each  7  =  0,1,....  The  next  result  makes  use  of  the  Delta  Method  and  its 
proof  is  available  in  Section  7.9. 

Proposition  5.  Assume  (AWlb)-(AW2)  to  hold  and  consider  an  arbitrary  map¬ 
ping  g  :  IN  — »  Mp.  If  for  some  t  —  0, 1, . . .,  the  convergence  (4.30)  holds  with  some 
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rv  L0(t),  then  it  holds  that 


with 


sfNZ^\t)  =* N  f'(q(t))Rg(t)Lo(t) 


R„(t)  :=  E  \W(t)  (1  -  g(W(t)) 


(7.25) 

(7.26) 


A  careful  inspection  of  the  proof  of  Proposition  5  in  Section  7.9  reveals  that  a 
somewhat  stronger  statement  is  true:  For  some  t  —  0, 1, ... . . ,  consider  the 
situation  where  a  sequence  of  HP-valued  rvs  {A^N\t),  AA  =  1,2,...}  is  weakly 
convergent  as  in  (7.23)  together  with  (4.30),  i.e.,  we  have  the  joint  convergence 

(VNL^\t),  A(7V)(t))  =► N  (L0(t),A(t)).  (7.27) 

Then,  it  holds  that 

(/NZf  \t),  Aw(l))  =^;v  (f\q(t))Rg(t)L0(t),A(t)) .  (7.28) 

7.5  A  Proof  of  Proposition  3 

Fix  t  —  0, 1, . . ..  Going  back  to  the  basic  decomposition  (7.18),  we  note  that 

+  1)  =  ilf  >(«)  +  Hj\t  +  1)  +  Zf >(i)  (7.29) 

for  each  N  =  1,2,...,  where  g  and  gt  are  the  mappings  IN  — >  WLP  defined  earlier  at 
(7.15)  and  (7.17),  respectively. 

By  Corollary  1,  we  already  have 

VNHf\t  +  1)  =Tr jv  H§(t  +  1)  (7.30) 
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where  the  rv  Hg(t  +  1)  is  a  zero-mean  Gaussian  rv  with  covariance  matrix 
Hg(t  +  1).  However,  note  that  the  rvs  L^\t)  and  Z~N\t)  are  both  ^-measurable 
for  each  N  —  1,2, .. ..  Therefore,  by  the  comments  following  Corollary  1, 
Proposition  3  will  be  established  if  we  can  show  the  joint  convergence 

s/N(L^(t),zf\t))  (Ljt(t),Z„(t))  (7.31) 

for  some  Revalued  rv  {LSt(t),  Zs(t ))  with 

Zg(t)  =  f(q(t))Rg(t)L0(t).  (7.32) 

Indeed,  as  indicated  at  the  end  of  Section  7.3,  Equations  (7.30)  and  (7.31)  imply 

VN(L^\t)]Z'f\t),Hf\t  +  1))  (Ljt(t),Zj(t),Hil(t+  1))  (7.33) 

with  the  Gaussian  rv  H^(t  +  1)  independent  of  (L§t(t),  Z§(t)).  Consequently,  we 
have  the  convergence 

VNL<f\t  +  1)  =► N  Lh(t)  +  Zg(t)  +  Hg(t  +  1).  (7.34) 

Combining  (7.34)  with  (7.32)  yields  the  desired  result. 

To  establish  the  validity  of  (7.31)  and  (7.32),  we  proceed  as  follows:  Under 
[E:t],  it  is  already  the  case  that 

VN( iUW.L'fW)  (7.35) 

while  the  strengthening  (7.27)-(7.28)  (with  A ^N\t)  =  \fNLi'^\t))  of  Proposition 
5  leads  to  (7.28)  in  the  form 

yv(4f»(«),zf>(«))  ^N(LiH(t),nq(t))R3(t)Lll(t)).  (7.36) 

In  other  words,  joint  convergence  (7.31)  takes  place  with  the  identification  (7.32) 
for  the  limiting  rv. 
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7.6  A  Proof  of  Proposition  1 


Fix  t  —  0, 1, . . .  and  iV  =  1,2,....  We  start  with  the  observation  that 

Viv  (/(^W)  - /(,(())) 

=  '/N  j  {)  (f'(x)  -  f(q(t)))dx  +  Vn  ^ — — - q(t)Jf(q(t)) 

=  VNU^N\t)  +  VNL{0N\t)f(q(t))  (7.37) 


where  we  have  set 


U[N\t):=  (f'(x)  -  f'(q(t)))dx. 

Jq(t) 

The  desired  conclusion  (4.31)  will  readily  follow  if  we  show  the  convergence 


VNU{N\t)  4 ;v  o. 


(7.38) 


To  that  end,  fix  £  >  0  arbitrary.  For  any  <5  >  0,  we  have 


VN\TjW{t) 


>  £ 


<  P 


VN\uW{t)\  >  e, 
QW  (t) 


QWQk) 


N 


q(t) 


<  8 


N 


- q(t ) 


>  5 


(7.39) 


By  the  continuity  of  f  at  x  —  q(t ),  we  know  that  for  each  rj  >  0,  there  exists 
8( rj)  >  0  such  that  whenever  \x  —  q(t)\  <  8( rj)  in  M+,  we  get 


\f\x)  —  f\q{t))\  <  rj. 


Now  fix  r] 

r  Q(JV)(9  _ 

1  N 


>  0  and  pick  8  >  0  in  the  range  (0,  8(r})).  Thus,  on  the  event 
q(t)  <  4],  we  find  that 


VN\U{N\t)\  <  VNrj 


QW(t) 

N 


q(t) 
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Reporting  this  fact  into  the  inequality  (7.39)  we  obtain 


P  \^N\Uw(t)\  >  e 

<  P  VN  -  q(t)  >  -]  +  P  [  tPAl  _  q{t)  >  4  (7.40) 

JM  ?7  J  |_  A 

Letting  N  go  to  infinity  in  (7.40)  and  using  the  convergence  (4.6)  and  (4.30),  we 
get 

lim sup  P  [\7/V  |f/(Ar) (t)\  >  el  <  P  L0(t)  >  -  .  (7.41) 

TV— XX)  L  J  _  V_ 

The  desired  conclusion  (7.38)  is  now  immediate  upon  letting  r)  >  0  go  to  zero  in 
this  last  inequality  since  its  left-hand  side  is  independent  of  rj. 


7.7  Strengthening  Claim  (ii)  of  Theorem  2 

Before  turning  to  the  proof  of  Proposition  4  in  the  next  section,  we  pause  to 
present  a  result  that  builds  on  Claim  (ii)  of  Theorem  2.  This  result  will  prove 
useful  in  establishing  Proposition  4  later  on. 

Proposition  6.  Assume  (AW1)-(AW2)  to  hold.  Then,  for  each  t  —  0, 1, . . .,  and 

any  function  g  :  IN  — >  M,  it  holds  that 
1  N 

e  [a(w(t))z(tY]  (7.42) 

i= 1 

for  each  integer  i  =  1,  2, . . .. 

Proof.  Fix  t  —  0, 1, . . .  and  t  =  1,  2, . . ..  Also  fix  N  =  1,  2, . . .  and  i  = 

We  have 

9(wf V>(t))ZiKW 

=  9(wr>(t))  hi  - /  (»))"*  -  a  - 

+Su(W<m(t))  (7.43) 
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with  mapping  gt/  :  ffi.  — >  M  given  by 

9tAw)  :=  9{w)  (1  -  f(q(t))Yw  ,  w  E  IN. 

On  the  other  hand,  for  any  pair  a,  b  in  [0, 1],  we  have 

| a?  -  bp |  =p\  f  tp~ldt\  <  p\b  -  a\  (7.44) 

J  a 


(7.45) 


(7.46) 


(7.47) 


since 

1  N 

jj  V  lVj(A')(t)|9(H/f>(())|  E  \W(t)\g(W(t))\\  (7.48) 

i= 1 

by  Claim  (ii)  of  Theorem  2. 

The  conclusion  is  now  immediate  from  the  decomposition  (7.43),  the  conver¬ 
gence  (7.48)  and  the  convergence 

1  N 

i„  e (7.49) 

i= 1 

obtained  from  Claim  (ii)  of  Theorem  2.  □ 
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7.8  A  Proof  of  Proposition  4 


The  proof  of  Proposition  4  relies  on  the  following  two  technical  lemmas;  their 
proofs  are  omitted  in  the  interest  of  brevity. 

Lemma  6.  For  any  x  in  K,  the  Taylor  series  expansion 

x2 

ejx  =  1  +  jx  — —  +  R(x)  (7.50) 

holds,  with  complex-valued  remainder  term  R(x)  satisfying 

i«wi  <  Jd!.  (7.5i) 


Lemma  7.  Consider  the  array  of  complex-valued  rvs  {Cjsr,i,  i  —  1, . . . ,  N;  N  = 
1,2,...}  with  |  C jv  ,i  |  <  1  fori  =  1  If  maxj=ir..)jv  \Cn,x\  ~^n  0  a.s.  and 

Yjt=\  CN,i  Ajv  A,  then 

N 

J](l  -  CN,i)  A N  e~\  (7.52) 

i= 1 

The  proof  of  Proposition  4  can  now  proceed:  Fix  N  =  1,  2, . . .  and  6  arbitrary 
in  M.  By  conditional  independence,  we  find  that 


E 

N 


[exp  +  1) 

nE  A  (j-%  (stwf^w)  (My  it + 1)  -  zy '(<))) 


i=  1 
N 


I  Tt 


i=i 


nti-cfb) 


(7.53) 


with 


cy\t)  =  zf'(i) 

+(1-Zf>(()) 


1  -  exp  (j-f=g(Wy\t))(  1  -  Z\N\t)) 

1  -  exp  (-j-Fglwy\t))zyit) 


(7.54) 
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for  each  i  —  1, . . . ,  N . 

In  view  of  these  remarks,  the  desired  result  (7.19)  is  now  a  simple  consequence 
of  Lemma  7  provided  the  required  conditions  can  be  shown  hold,  namely 

lim  max  |C,,-iV)(t)|  =  0  a.s.  (7.55) 

N  i=l,...,JV  *  Wl  v  ' 

and 

N 

£cf’(«)LWi+  1)  (7.56) 

i= 1 

with  <7g(t  +  1)  given  by  (7.20). 

Condition  (7.55)  trivially  holds.  To  establish  (7.56)  we  invoke  Lemma  6  to 
write 

Cf°M  =  +  (1  -  Z,W(t))Bgp(t)  (7.57) 

with 

-  zf  >(«))2  +  /?<")((;  0)  (7.58) 

and 

=  j-Lg(H7OT(i))zf)(«) 

+^9(irjW(t))2zy>(t)2  +  /?<")((;  B)  (7.59) 

where  remainder  terms  (3^  )  (t;  9)  and  f3 ^  (t;  9)  in  these  expressions  satisfy 

l/CM)UA?M)l^c^p 

for  some  positive  constant  C,  say  C  :=  max{|g(w)|3,  w  =  1, ... ,  Wnmx } . 

Reporting  (7.58)  and  (7.59)  into  (7.57)  and  simplifying  the  resulting 
expression,  we  find 

cf>(t)  =  ^s(W^<iV)(*))2Zj'V)(()(l  -  zf\t))  +  7iN>(t;  B)  (7.60) 


135 


with  remainder  term 


7 f ’(*; »)  :=  «)  +  (!- zfHmTAt',  9). 

Obviously, 

l7f)(«;9)|  <C^L.  (7.61) 

It  is  now  plain  that 

Ecf'W  =  |  (^E</(»r,l',)(i))2d',)(i)(l  -  dN)W))  +r«A')(i)  (7.62) 

1=1  \iV  7=1  / 

where  we  have  set 

1  N 

rmW  =  ^E7,OT(«;9). 

2=1 

By  Proposition  6  (with  t  =  1  and  £  =  2)  we  get 

i  N 

^Y,^WiN\t)fz\N)m-Z\N\t))  4 „  a9(f+l)  (7.63) 

2=1 

with  crfl  (t  +  1)  given  by  (7.20),  while  lim;v-_).oor(JV)(£)  =  0  a.s.  by  virtue  of  (7.61). 
The  desired  conclusion  (7.56)  is  obtained  and  the  proof  of  Proposition  4  is 
complete. 


7.9  A  Proof  of  Proposition  5 

Fix  iV  =  1,2,...  and  i  —  1, . . . ,  N.  Observe  that 

7(iv)(i)wiWW_7W^)(t) 

rf(Q(Nht)) 

=  WlN\t)  N  (l-^PP)-1  dy 
=  W,iN\t)Af\t) 

+  W,fN\t)  (/(ShlW)  _  /(,(t))j  (1  _  /(S(t)))w,.'">«)- ■  (7.64) 
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where  we  have  set 


f(Q(JV)W) 

A f\t)  :=  f 

'(i  -  yjd"’"'-1  -  (1  -  /(9(t)))<°e>-'' 

df(q(t)) 

L  J 

Consequently, 

+  ■  w(/(^>  -  /(«(*») 

(7.65) 


where  the  mapping  g*  :  IN  — »•  Mp  is  defined  by 

9t(w)  '■=  w#0)  (i  -  f(q(t)))w~1 ,  wen. 


Using  the  inequality  (7.44),  we  find  that 
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Let  A^  go  to  infinity.  Coupling  the  convergence  (7.3)  with  Proposition  1  we  find 
that 
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It  now  follows  by  Claim  (ii)  of  Theorem  2  (applied  to  the  mapping  w  — »  w\g(w)\) 


that 

1  N 

^Y,9(W,m(t))W,{N>(t)Af\t)  i„0.  (7.67) 

iV  i  1 

Thus,  in  view  of  the  decomposition  (7.64)  (with  (7.65))  and  of  the 
convergence  (7.67),  it  is  now  plain  that  the  convergence  (7.25)  will  hold  provided 
we  can  show  that 

vW  (/(2^W) -  /(,(()))  =S-W  f'(q(t))R„(t)L0(t). 

(7.68) 

However, 

i  N 

i=l 

and  (7.68)  follows  by  applying  Proposition  1. 
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Chapter  8 


Conclusions  and  Future  Directions 

8.1  Conclusions 

The  main  focus  of  this  dissertation  is  on  characterizing  the  dynamics  of  the  TCP 
congestion-control  mechanism  and  AQM  (in  particular  RED)  in  the  regime  where 
the  number  of  competing  flows  is  large.  The  systems  we  considered  can  be 
thought  of  as  stochastic  feedback  systems.  The  first  system  we  considered  is  the 
rate-based  model  in  Chapter  3  and  the  Weak  Law  of  Large  Numbers  (Theorem 
1)  is  established  for  the  model.  It  reveals  the  natural  simplification  of  the  system 
dynamics.  In  fact,  Theorem  1  indicates  that  the  asymptotic  queue  follows  a 
simple  deterministic  recursion  which  involves  only  the  expectation  of  the  limiting 
number  of  packets  injected  into  the  network  in  a  timeslot.  The  recursion  for  the 
limiting  number  of  packets  injected  into  the  network  is  also  closely  related  to  the 
recursion  of  a  single  traffic  flow,  i.e.,  at  any  time  they  both  have  the  same 
conditional  expectation  given  the  same  state  in  the  previous  timeslot  and 
marking/ dropping  probability. 

While  the  model  is  subsequently  refined  to  be  more  realistic  in  Chapter  4-5, 
similar  results  can  also  be  established.  These  Weak  Laws  of  Large  Numbers 
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therefore  provide  justification  for  the  analysis  of  the  TCP/AQM  interaction 
through  a  deterministic  feedback  system  when  there  exists  a  large  number  of 
flows.  More  specifically,  the  recursion  of  the  asymptotic  (or  average)  queue, 
i.e.,  q(t),  depends  only  on  the  capacity  of  the  queue  and  the  expected  amount  of 
traffic  injected  into  the  network  in  each  timeslot,  i.e.,  E  [A(f)].  This  is  similar  to 
many  of  the  existing  models  which  analyze  the  dynamics  of  TCP  and  AQM 
mechanisms  as  deterministic  feedback-delay  systems. 

Following  the  aforementioned  observation,  this  research  complements  the 
control-theoretic  studies  of  TCP/AQM  dynamics.  Much  of  the  research  effort 
along  this  direction  has  been  focused  on  either  the  locally  or  globally  stability  of 
the  controlled  queue  (as  surveyed  in  Section  2.2).  These  work  reveal  sufficient 
conditions  for  the  queue  to  either  be  local  or  global  asymptotically  stable, 
i.e.,  the  queue  eventually  settles  to  an  equilibrium  value.  Under  these  sufficient 
conditions,  Assumption  (AW4),  i.e.,  (q(t),Y(t))  =^jv  (<?*,  Y*),  is  reasonable.  In 
other  words,  the  average  behavior  of  such  complex  stochastic  feedback  system  at 
the  equilibrium  becomes  easy  to  specify  through  steady-state  analysis  given  that 
there  is  a  large  number  of  flows  and  the  system  parameters  are  set  appropriately 
using  the  sufficient  conditions  from  the  control-theoretic  analysis. 

The  system  in  the  large  number  of  flows  asymptotic  regime  at  the 
steady-state  is  simple  to  analyze  because  of  the  behavior  of  any  single  flow  no 
longer  influences  the  network  behavior.  Also,  the  finding  that  the  flows  become 
asymptotically  independent  also  supports  the  belief  that  RED  breaks  global 
synchronization  with  a  large  number  of  flows. 

As  expected,  the  natural  simplification  of  the  models  and  these  asymptotic 
properties  can  be  found  in  the  regime  where  the  number  of  interacting  flows  is 
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large  without  having  to  rely  on  ad-hoc  assumptions  during  the  analysis. 
Furthermore,  the  limiting  models  are  also  compatible  with  other  previously 
proposed  models  in  their  respective  regime,  thereby  revealing  the  versatility  of 
the  limiting  models. 

While  the  Weak  Laws  of  Large  Numbers  capture  the  average  behavior  of  the 
systems,  the  Central  Limit  analysis  capture  the  errors/uncertainties  due  to  the 
randomness  in  the  system.  These  uncertainties  appear  as  fluctuations  in  the 
queue  and  are  not  represented  in  the  deterministic  models. 

It  is  noteworthy  from  the  Central  Limit  analysis  that  the  random  marking 
mechanism  in  AQM  always  introduces  random  fluctuations  in  the  queue.  This  is 
an  intrinsic  behavior  for  the  random  marking  mechanism  due  to  the  limited 
granularity  in  the  feedback  information.  As  noted  in  Chapter  5,  such  a 
fluctuation  can  be  reduced  by  improving  the  quality  of  the  feedback  information 
either  through  an  increase  in  the  number  of  ECN  bits  or  through  an  in-band 
signaling  mechanism  as  suggested  in  [56].  There  are,  however,  other  means  to 
avoid  such  fluctuations.  For  example,  TCP  Vegas  [9]  and  its  variants  such  as 
TCP  FAST  [27]  use  delay  information  instead  of  marks  to  adjust  their  congestion 
windows.  Given  an  accurate  timestamp  in  the  packets,  each  flow  can  accurately 
calculate  the  appropriate  adjustment  to  its  congestion  window  size,  thus  greatly 
reducing  uncertainty  from  the  randomness  in  the  marks. 

Although  the  fluctuations  due  to  randomness  can  be  well-approximated 
through  the  CLT,  the  feedback  system  in  the  deterministic  (or  limiting)  models 
also  introduces  fluctuations  (or  oscillations)  as  classical  control-theoretic  analysis 
indicates.  The  random  fluctuations  derived  in  the  Central  Limit  analysis 
complement  these  oscillations.  Moreover,  some  parameters  in  the  system  can  also 
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induce  fluctuations  in  both  components,  e.g.,  the  steep  slope  of  the  marking 
probability  function  not  only  causes  fluctuation  in  the  random  components  as 
suggested  by  the  Delta  Method  but  also  induces  oscillations  as  the  feedback  gain 
being  large  in  the  deterministic  model. 

Overall,  this  Ph.D.  dissertation  provides  a  novel  framework  for  characterizing 
the  asymptotic  dynamics  of  the  network  containing  many  flows  where  flows 
interacts  with  each  other  through  the  random  feedback  signaling  from  the 
network.  The  results  obtained  are  both  compatible  with  and  complementary  to 
the  existing  literature.  The  approach  used  here  can  also  be  generalized  to  other 
weakly  interacting  stochastic  feedback  systems. 

8.2  Future  Directions 

While  this  thesis  studies  in  depth  the  problem  of  characterizing  the  dynamics  of 
the  TCP  congestion-control  mechanism  and  AQM  schemes,  there  remain  several 
open  problems  for  research  as  outlined  below. 

Uncontrolled  cross  traffic 

First,  consider  the  situation  where  the  bottleneck  gateway  has  uncontrolled  cross 
traffic,  i.e.,  the  amount  of  traffic  injected  into  the  network  is  unresponsive  to  the 
network  congestion,  then  the  overall  number  of  packets  arriving  at  the  bottleneck 
gateway  will  be  a  combination  from  both  TCP  flows  and  uncontrolled  flows.  If 
the  queue  is  small  and  the  amount  of  uncontrolled  cross  traffic  is  small 
comparing  to  the  capacity  (as  it  should  in  any  stable  network),  then  a  reasonable 
approximation  is  to  reduce  the  capacity  of  the  bottleneck  gateway  equal  to  the 
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number  of  packets  from  the  uncontrolled  flows  in  each  timeslot  in  the  model.  The 
only  difference  in  the  modified  model  from  the  models  considered  in  this  thesis  is 
that  the  available  capacity  in  the  bottleneck  gateway  varies  as  a  function  of  time. 
In  [52],  a  discrete-time  traffic  model  such  as  M/G/oo  traffic  arrival  process  has 
been  found  suitable  for  modeling  the  uncontrolled  cross  traffic  comprising  of 
many  uncontrolled  flows. 

Multiple  bottleneck 

While  recent  work  has  shown  that  a  single  bottleneck  model  is  a  reasonable 
approximation  for  an  analysis  of  open-loop  or  uncontrolled  flows  in  a  multi-stage 
network  of  queues  [12],  it  is  not  clear  that  this  would  be  the  case  for 
feedback-based  flows  such  as  TCP.  Studies  have  shown  that  a  large  number  of 
TCP  flows  over  multiple  bottleneck  queue  utilizing  Tail  Drop  gateway  exhibits 
very  complex  behavior  [3].  It  remains  an  open  problem  on  how  a  collection  of 
TCP  flows  will  behave  in  a  general  network  setting.  Deterministic  models  of 
congestion-control  utilizing  an  optimization  framework  are  proposed  in  [30,  37].  It 
merits  an  investigation  as  to  whether  these  deterministic  models  can  be  justified 
under  a  large  number  of  flows  asymptotic  similar  to  the  single  bottleneck  case. 

Wireless  physical  layer 

In  all  of  the  models  considered  in  this  thesis,  the  only  point  of  interaction 
between  flows  are  at  the  bottleneck  gateway.  However,  if  flows  also  share  a 
common  wireless  physical  layer  (such  as  the  third  generation  wireless  data 
networks  or  ad-hoc  networks),  then  flows  can  also  interact  in  the  physical/MAC 
layer  as  well.  For  example,  the  flows  utilizing  the  downlink  of  the  cdma2000 
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lxEV-DO  (IS-856)  wireless  cellular  network  will  interact  through  the  proportional 
fair  scheduler  and  their  uplink  packets  (e.g.,  ACK  packets)  will  interact  through 
interference  in  the  physical  layer  and  its  MAC  layer  mechanism  [62],  The  effect  of 
this  added  layer  of  interaction  to  the  system  is  difficult  to  analyze  and  quantify 
using  traditional  modeling  techniques.  Nevertheless,  we  can  expect  that  model 
simplification  will  occur  in  the  large  number  of  flows  asymptotic  regime.  This 
might  shed  some  insights  on  the  complex  behavior  of  the  system. 

Selection  of  the  feedback  function 

As  previously  mentioned  in  the  discussion  on  the  CLT  in  Chapter  5,  the  selection 
of  the  appropriate  feedback  mechanism  can  now  be  evaluated  systematically 
through  the  limiting  models. 

Extension  to  other  congestion  controllers  and  AQM  mechanisms 

Although  the  modeling  efforts  in  this  thesis  are  concentrated  on  TCP/RED 
which  are  the  dominant  combination  of  the  existing  congestion-control  and  AQM 
mechanism,  it  is  easy  to  generalize  the  approach  to  other  congestion  controllers 
and  AQM  mechanisms  (see  [58]  for  an  example).  We  will  briefly  describe  such 
generalizations  in  this  section. 

AQM  gateways  control  their  level  of  congestion  by  randomly  marking 
incoming  packets  to  signal  the  traffic  sources  of  the  congestion  level.  In  order  to 
do  so,  each  AQM  mechanism  calculates  a  quantity  which  can  be  referred  to  as  a 
congestion  measure.  The  congestion  measure  in  a  timeslot  can  be  a  function  of 
the  queue  sizes  (current  size  and  past  values),  of  the  volume  of  the  incoming 
traffic,  and  of  the  previous  values  of  the  congestion  measure.  We  can  then  use  a 
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continuous  mapping  to  map  this  congestion  measure  to  the  marking  probability 
in  the  AQM  mechanism.  Under  appropriate  scaling  rules,  this  marking 
probability  will  converge  under  a  large  number  of  flows. 

The  congestion  controllers  can  be  generalized  through  the  deterministic 
mappings  which  map  the  current  and  past  history  of  the  state  variables 
(e.g.,  congestion  window)  and  marks  from  the  AQM  mechanisms  to  the  state 
variables  in  the  next  timeslot.  Under  appropriate  assumptions,  the  limiting 
model  of  the  generalized  system  can  be  derived  along  the  lines  of  the  analysis 
given  in  this  dissertation. 
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